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ABSTRACT 

We present a catalog of emission-line galaxies selected solely by their emission-line fluxes using a wide-field 
integral field spectrograph. This work is partially motivated as a pilot survey for the upcoming Hobby-Eberly 
Telescope Dark Energy Experiment (HETDEX). We describe the observations, reductions, detections, redshift 
classifications, line fluxes, and counterpart information for 397 emission-line galaxies detected over 169 □' 
with a 3500-5800A bandpass under 5A full-width-half-maximum (FWHM) spectral resolution. The survey's 
best sensitivity for unresolved objects under photometric conditions is between 4 — 20 x 10 -17 erg s _1 cm -2 
depending on the wavelength, and Lya luminosities between 3 — 6 x 10 42 erg s" 1 are detectable. This survey 
method complements narrowband and color-selection techniques in the search for high redshift galaxies with its 
different selection properties and large volume probed. The four survey fields within the COSMOS, GOODS-N, 
MUNICS, and XMM-LSS areas are rich with existing, complementary data. We find 104 galaxies via their high 
redshift Lya emission at 1.9 < z < 3.8, and the majority of the remainder objects are low redshift [OII]3727 
emitters at z < 0.56. The classification between low and high redshift objects depends on rest frame equivalent 
width, as well as other indicators, where available. Based on matches to X-ray catalogs, the active galactic 
nuclei (AGN) fraction amongst the Lya emitters (LAEs) is 6%. We also analyze the survey's completeness 
and contamination properties through simulations. We find five high-z, highly-significant, resolved objects 
with full-width-half-maximum sizes > 44 □" which appear to be extended Lya nebulae. We also find three 
high-z objects with rest frame Lya equivalent widths above the level believed to be achievable with normal star 
formation, EWo > 240A. Future papers will investigate the physical properties of this sample. 

Subject headings: galaxies: formation — galaxies: evolution — galaxies: high-redshift — cosmology: obser- 
vations 
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The Hobby-Eberl y Telescope (HET) Dar k Energy Exper- 
iment (HETDEX) (iHill etal.l l200l l2008al) will survey 60 
□° spread throughout 420 D° to discover 0.8 million new 
Lyman-a emitting galaxies (LAEs) over 1.9 < z < 3.5 
and use them to map the expansion history of the universe. 
A further ~4 million low-z galaxies will have their red- 
shifts determined, primarily in the [Oil] 3727 transition, over 
< z < 0.47. The primary HETDEX science goal is to 
measure the dark energy equation of state at high redshift 
by using the three-di mensional power spectrum of LAE po- 
sitions and redshifts dJeong & Komatsu 2006; Koehl eret al.l 
120071: Ueong & Komatsull2009t IShoji et al.ll2009l) . An impor- 
tant secondary goal of HETDEX is to investigate the phys- 
ical properties of star forming galaxies, through Lya and 
[Oil] emission, using vastly greater statistics and volumes 
than currently available. The survey will use an array of 
150 integral field spectrographs on the upgraded 10 m HET 
dRamsev et al.ll998HSav"age et al.l2010l) called the Visible In- 
tegral field Replicable Unit Spectrograph (VIRUS; iHill et al.l 
120101) . 

The HETDEX Pilot Survey (HPS) is the pathfinder to the 
full HETDEX survey. This pilot survey provides a direct 
test of equipment, data reduction, target properties, observ- 
ing procedures, and ancillary data requirements to HETDEX 
by using one integral field spectrograp h, named the VIRUS 
prototype (VIRUS-P; IHill et al.l [2008M) , on the 2.7 m Har- 
lan J. Smith telescope at the McDonald Observatory over 111 
nights. To do this, the pilot survey uses the novel technique 
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of blind, field-of-view, wide-field contiguous spectroscopy 
to find emission line objects over a broad redshift range. 
While large numbers of narrowband-s elected LAEs have been 
assembled by prev i ous surveys (e.g. iHu & McMahonl 1 1996 b 
Cowie & Hul fl998t iRhoads et al.l 12000: iSteidel et alTpOOOt 
Ouchi et al.1 120031: iHuet al.l 12004 lHavashino et ail 12004 



Santos et al.ll2004t lPalunas et al.ll2004t IVenemans et al.|2005[ 
Gawiser et al.l2006HGronwall et al.l2007l: | Nilsson et al.l 2007]: 
Ouchi et al l 120081: iNilsson et al.l l2009TlGuaita et al.l l20ld 



Tilvi et al. 12010) . these surveys are heterogeneous in na 



ture, with different depths and equivalent width (EW) lim- 
its. The HPS is designed to produce a homogeneous sam- 
ple of LAEs over an extremely large volume, 1.03 xlO 6 
Mpc 3 h 70 3 , that is nearly an order of magnitude larger than 
the largest existing blind spectroscopic survey, 2.5 xlO 5 
Mpc 3 hf 3 ( iCassata et all 120 10), and vastly larger than other 
blind su rveys dPirzkal et : dj|2004t Ivan Breukelen et al J 120051: 



IXu et al.ll2007h iSawicki et al.ll2008b iMartin et al.ll2008l) . This 
allows us to evaluate potential redshift evolution of LAE prop- 
erties and to make comp arisons to color-selected high redshift 
galaxy populations (e.g. l Steidel et al.ll 9961 [1999: Dad di et al.l 
l200l iKornei et al.1 12010). The HPS also enables us to find a 
large sample of lower redshift galaxies selected through, pri- 
marily, their [OII]3727, H/3, and [OIII] emission and study 
their properties over a lower redshift ranges (up to z =0.56, 
0.19, 0.17, and 0.16 for [Oil], H/3, [OIII]4959, and [OIII]5007 
respectively). 

The paper is organized as follows. In £)2.1| we describe the 
instrumental capabilities of VIRUS-P, the type and quality of 
data taken, the necessary calibrations, and the imaging com- 
piled to aid source classification. We detail the data reduction 
steps, with spe cial care given toward tracking systematic er- 
rors in S|3] In £|4.1| we describe the methods used to recover 
objects to the survey's statistical limits and analyze the effect 
of noise contamination and the emission-line flux measure- 
ments. In S|5] we present our classification methods, relying 
primarily on imaging counterpart likelihoods and equivalent 
width measurements. The contamination of the high redshift 
LAE sample by active galactic nuclei (AGN) is presented as 
well as example classifications. The final emission-line cat- 
alog and its summary properties are given in ||6] Finally, in 
SjT] we review the analysis and describe its place in future 
projects. 

In this work, we adopt a standard ACDM cosmology with 
H o =70 km s- 1 Mpc~\ ft M =0.3, and fl A =0.7. All mag - 
nitudes are quoted in the AB system dOke & Gunnl 119831) . 
All wavelengths are corrected to vacuum conditions in the 
heliocentric frame with an assumed wavelength-independent 
index of refraction for air at the observatory's altitude of 
n = 1.00022. 

2. OBSERVATIONS 

2.1. Instrumental configuration 

The Visible Integral-field Replicable Unit Spectrograph 
Prototype (VI RUS-P) was desig ned for this pilot survey and 
is described in lHill et"ail (12008b!) and references therein. The 
instrument is a fiber-based Integral Field Spectrograph (IFS) 
fed at f/3.65 on the McDonald Observatory's 2.7m Harlan J. 
Smith telescope. A small focal reducer sits just prior to the In- 
tegral Field Unit (IFU) input in the lightpath of the telescope's 
f/8.8 focus. Originally, VIRUS-P operation used a focal re- 
ducer labeled FR1, but all data taken after September of 2008 
used a second focal reducer labeled FR2, which has signif- 



icant ly im proved efficiency below 4000A compared to FR1 
(see £|2.41 i. Auto-guiding and sky transparency measurements 
were performed with an off-the-shelf Apogee Alta camera in- 
stalled into a field position ~ 9' north of the IFS field of view 
(FOV). The guiderhas a square 20.25D' FOV and uses a B+V 
filter with a mean wavelength of 5000A at a platescale of 0753 
per pixel. 

Two different IFUs have been used over the course of this 
pilot survey. Fiber bundle IFU-1, used prior to March 2008, 
spans 1.'70 x 1.'77 with 244 functional and 3 broken 200 fim 
core diameter (4'.'235 on-sky) fibers. IFU-2 spans 1 '61 x L65 
with 246 functional and broken fibers of the same core size. 
There is no significant difference in throughpu t between the 
bundl es. Both IFUs are of the densepak type dBarden et al.l 
119981) with a filling factor near 1/3, requiring at least three 
dithered positions to fully sample the FOV. This survey uti- 
lizes a six position dithering pattern as illustrated in Figure Q] 
The nearly x2 oversampling delivered by this dithering pat- 
tern provides improved spatial registration between detected 
spectral objects and imaging-based continuum counterparts. 
The wavelength range on VIRUS-P is adjustable from 3400- 
6800A and a set of volume phase holographic gratings deliv- 
ering various spectral resolutions are available. For this sur- 
vey the instrument was set to cover 3500-5800A at resolutions 
that range from 4.5-5.5A full width half maximum (FWHM) 
over the whole dataset through a 831 lines mm -1 grating that 
delivers a dispersion of 1.1 A pixel -1 in the unbinned charge- 
coupled device (CCD) mode. The spectral resolution over that 
range weakly and gradually varies with wavelength and be- 
tween different fibers due to CCD surface shape deviations 
from planarity, camera design limits, and the residual camera 
alignment errors. The data are recorded on a 2kx2k CCD 
with 15 /j,m pi xels in a custom built, LN2 cooled, vacuum- 
sealed camera dTufts et al. 2008) with electronics that deliver 
between 3.6-4.2 e~ read noise, making the sky background 
the dominant source of noise at all wavelengths in our 20 
minute exposures. The data have been taken with 2x1 bin- 
ning along the dispersion direction to minimize read noise and 
still maintain a Nyquist sampling of the instrumental line pro- 
file. 

Several instrumental properties determine the survey's cal- 
ibration needs. The ins trument's sca t tered light properties 
have been discussed in lAdams et al.l (120081) . A weak in- 
focus ghost of atmospheric OH lines redder than the tar- 
geted wavelength range was found to exist at discrete wave- 
lengths. These lines are easily distinguished by their devia- 
tions from calibrated wavelength solutions and fiber trace po- 
sitions. The strength of the scattered light varied over time 
as alignments changed and baffling was implemented, but the 
ghost's strength was at maximum 3x the resolution element 
noise, and more characteristically below the noise in any one 
fiber. The scattered light affected one resolution element per 
fiber. Extra masking installed around the grating solved this 
issue for all data taken after September 2008. All emission- 
line sources discussed in this paper from observations prior 
to the installation of the grating mask have been visually in- 
spected to not lie in the affected regions. 

The lab testing and characterization of the VIRUS-P fibers, 
with particular attention to trans mission and focal rat io degra- 
dation, has been investigated in lMurphv et al.l d2008l) . A high 
stability in each fiber's throughput over a night, at minimum, 
is crucial toward the survey's goals. IFU mounting practices 
have been established from these tests to yield fiber stabil- 
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ity sufficient for our purposes. To facilitate mounting on the 
HET as well as the Smith telescope, the IFU was made longer 
than otherwise necessary. Since the IFU demonstrated infe- 
rior performance when coiled, the fibers were left uncoiled 
for most of this pilot survey. When the IFU bundle is properly 
uncoiled, it is measured on-telescope to be stable over nightly 
operating conditions to 1 % root-mean-squared (rms) for the 
most affected fibers and 0.3% rms for the median fiber. We 
wil l exp lore the effect of this potential systematic on the data 
in i)3.6l There, we will show that the VIRUS-P fiber stability 
is not an important issue for emission-line detections, but can 
dominate the uncertainty in continuum estimates. 

The mechanical design of VIRUS-P has been presented in 
ISmith et ail ( 120081) . The instrument's mechanical structures 
are all made from aluminium to achieve a uniform coefficient 
of thermal expansion between components and to maintain 
the optical alignment. The gimbal mount connecting VIRUS- 
P to the telescope allows VIRUS-P to swing into a horizon- 
tal position for any pointing of the equatorially-mounted tele- 
scope. This ensures that the trace patterns of fibers on the 
CCD remains constant to high precision over a night. Al- 
though a < 0.05 pixel trace shift per night is desired, this 
could not always be accomplished. A trace could shift by up 
to 0.3 pixels with temperature under some operating condi- 
tions. Consequently, data reduction steps were developed to 
identify and compensate for this subtle systematic; these are 
described in Sj3] There is not an atmospheric differential cor- 
rector installed on the telescope. We discuss the atmopheric 
effects on emission-line source astrometry i n Ap pendixlAland 
the absolute flux calibration of the data in i)2.4| All observa- 
tions were taken with airmasses below two. 

2.2. Data collection 

We obtained regular fall/winter/spring dark time observa- 
tions from September 2007 to February 2010 on the Mc- 
Donald 2.7m Harlan J. Smith telescope. These observing 
runs are summarized in Table Q] In total, out of our allo- 
cation of 113 nights, 61 were useful for this project. We 
constructed datacube mosaics in four sc ience fields: the Cos - 
mological Evolution Survey (COSMOS :lScoville et al.l2007l) 
the Hubble Deep Field North (HDFN; iWilliams et al.1l!996l) 
and the surroundin g Great Observatories O rigins Deep Survey 
North (GOODS-N lDickinson et~ai1l2003l). the Mun ich Near- 
IR Cluster Survey (MUNICS: iDrorv et alj 12001b, and the 
XMM Large Scale Structure field (XMM-LSS: iPierre et al.l 
120041) . We completed 27, 13, 16, and 4 field pointings, respec- 
tively in these fields, by taking three 20-minute exposures at 
each of the 6 dither positions. Our effective observation area, 
accounting for mosaic overlap, is 169.23 □' over the wave- 
lengths ^3500-5800A with a spectral resolution of ~5 A. This 
corresponds to survey volumes of 1.03 xlO 6 Mpc 3 hy 3 for 
LAEs and 4.24 xlO 4 Mpc 3 hf 3 for [Oil] sources. As de- 
scribed in [23] and shown in Figure we give the survey's 
flux and luminosity limits as a function of wavelength under 
photometric conditions for the case of a spectrally unresolved, 
point source emission-line object well centered on a fiber. 

In addition to the science data, the following calibration 
data were obtained one or twice each n i ght. S pectrophotomet- 
ric standard stars from iMassey et al.l (119881) were observed. 
Flats near zenith of the dawn and dusk sky were taken. Cal- 
ibration with dome lamps was explored but abandoned when 
none were found with sufficient blue-to-red flux balance. Sets 
of bias frames were taken and used to construct a master 



bias for each run. HgCd arc lamps were used to illuminate a 
dome screen for wavelength calibration. Custom line lists for 
the HgCd lamps were made by o bserving the lam ps with the 
2.7m's Tull Coude Spectrograph dTull et al.ll 19951) at R=60k. 
The Coude wavelength calibration was made from ThAr lines. 
For most of the observing runs, guider frames were saved at 
intervals of 2-10 seconds, depending on the guider star bright- 
ness and transparency. The collection of guider frames was 
prevented 13% of the time due to human error and guider 
equipment failure. For those observations, the flux calibration 
was done assu ming the median of the observed atmospheric 
transmission ( ^2.4| i from the dataset's remaining observations. 

2.3. Astrometry 

The position of a faint source is not well determined by the 
IFS data alone since most pointings lack sufficiently bright 
stars to establish an astro me trie solution for the frame. In- 
stead, the positions of stars in the offset guider camera were 
used to determine the fiber positions; this required precise 
calibration of the relative astrometry between the fiber array 
and the offset guider. The relative fiber-to-fiber positions of 
both IFUs were measured in the laboratory and verified to be 
very regular due to the precise machining. Illumination and 
direct imaging in the lab showed that IFU-2 has exceptional 
uniformity in its fiber matrix, and no deviations from the de- 
signed pitch of 340 jum could be measured to an accuracy of 
1 /xm. IFU-1 is somewhat less uniformity in its fiber matrix 
than IFU-2. We have mapped the centroid of each fiber to 
within 0.3/im, or 07007, at the nominal plate scale. 

The transformation from guider field position to science 
field position was calibrated by on-sky measurements. When- 
ever the guide camera was replaced, we obtained data under 
a six dither pattern on open clusters at low airmass. In total, 
seven astrometric solutions were derived, each yielding the 
plate scales, offsets, and rota tions of two image planes un der a 
standard tangent projection dGreisen & Calab retta 19931). We 
found a adequate fits with constant plate scales determined 
for each IFU axis yielding twelve degrees of freedom in a 
non-linear transformation from guider and IFS pixel posi- 
tions to celestial coordinate s. We first determined gu ider po- 
sitions by using SExtractor dBertin & Arnoutsll 19961) to mea- 
sure the positions of stars and match to coordinates from the 
United States Naval O bservatory's (USNO) Nomad catalog 
dZacharias et al.l 120051) . Similarly, the continuum intensities 
of USNO stars in the fibers were measured by summing flux 
over the wavelength range 4100A< A <5700A; this region 
was chosen to mimic the guider wavelength response and 
minimize atmospheric refraction differences. Fibers contain- 
ing signal significantly above the noise were matched with 
significant detections in adjacent fibers. Centroids were cal- 
culated for each source a nd again matched to the Nomad cat- 
alog. A simplex method (Pr ess et al.ll 19921) was then used to 
find the least squares minimum robustly in the presence of the 
many local minima. We show in Figure Q] the fit quality in a 
representative solution. The range of systematic uncertainty 
in our seven eras of astrometric solutions was 0717-0751 with 
a median of 0731. 

We further measured the stability of the astrometry over 
many months from flux standard stars. We anticipated any 
drift to be negligible due to the design of plastic pins which 
located the IFU head against the telescope mounting surface. 
However, we found substantial month-to-month systematic 
variations of order l'/8 rms. The only clear dependence was 
a declination term with temperature, which we attribute to 
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a thermal expansion of the guider camera mount. However, 
this expansion cannot explain the bulk of the astrometric scat- 
ter. Since we fin much smaller astrometric scatter in any one 
month, the monthly removal and remounting of the IFU in- 
put head from the telescope between observing runs is the 
plausible source of drift. So, we have chosen to estimate an 
empirical month-by-month offset in the astrometric zeropoint 
which lowers the median monthly rms to 076 and ranges from 
O'.'O-l'.'O. 

Coarse positional sampling by the large fibers and low S/N 
limitations forms the final component of the astrometric error 
budget. In order to quantify this uncertainty, we have sim- 
ulated the positional recovery for a range of em ission line 
sources. We describe those simulations in ^4.31 The result 
is a fit to the random astrometric uncertainty with a functional 
form of a r , random = 0'.'348 + 2! 04/ (S/N). 

We can assess the completeness of our error budget by mea- 
suring the observed positional offsets of emission-line objects 
found with high confidence counterparts. As explained in $5] 
a comparison of our fiber detections with broadband imaging 
shows that 55% of our emission-line detections have an iso- 
lated counterpart detected with > 90% confidence. Through 
a comparison, we find a mean offset of Aa = — C/53 ± 0'.'05 
and AS = 0'.'39 ± 0'.'05 between the fiber-based emission- 
line source positions and the broadband photometric centers. 
The source of this offset is not certain, but we apply it to all 
our reported emission-line positions. After correcting for this 
offset, the counterpart associations were iterated to produce 
our final emission-line positions. In Figure [3] we present the 
distribution of the data offsets to test the error budget. This 
error budget serves as an important input in the method (fj5) 
for assigning broadband counterparts in crowded fields to the 
emission-line sources. 

2.4. Flux calibration and transparency 

The majority of the observations were not taken under pho- 
tometric conditions, hence a proper flux calibration requires a 
realtime measurement of the atmospheric transparency. Un- 
like some modern wide-field imagers, the VIRUS-P field of 
view is not large enough to contain photometrically calibrated 
stars in the majority of its arbitrary pointings. However, 
the offset guider with a larger field of view has a size suf- 
ficient for this continuous calibration purpose. We recorded 
all guide camera exposures sampled at 2-10 seconds that 
were contemporary with the IFS science exposures. The 
guider exposure times varied depending on the guide star 
brightness. Basic bias-subtraction and fiat-fielding reduc- 
tions were implemented on the guider frames. We performed 
aperture photometry on all stars detected. When available, 
we used Sloan Digital Sky Su rvey (SDSS) measurements 
dAdelman-McCarthv et al.l [20081) for our calibrations ; other - 
wise we used the USNO-B1.0 survey dMonet et al.l 120031) . 
The SDSS photometric precision is quoted at below 1% for 
guide stars used, typically V< 19. The USNO-B1.0 photo- 
metric precision is typically much worse, ~0.25 magnitudes, 
and this directly leads to an important uncertainty in line 
fluxes for objects in the MUNICS and XMM-LSS fields. Ac- 
cordingly, we have added in quadrature a 15% error, assum- 
ing the median of three guide stars per field, to the flux and 
equivalent width (EW) measurements for the MUNICS and 
XMM-LSS sources. We treat these errors as random, since 
multiple and independent sets of stars were used in different 
mosaic pointings and multiple spectrophotometric standards 
were observed. A color term was fit from the guider data con- 



sidering its non-standard, wide-bandpass filter, a new zero- 
point was calculated each month to correct for periodic equip- 
ment changes and mirror cleanings, and non-photometric ex- 
tinctions were found for each frame after removing a standard 
airmass term of 0.186 mag AM -1 . Typically, we had two to 
five stars per field that were bright enough for this purpose. 
The resultant distribution of zeropoint offsets due to trans- 
parency, Azp, is given in Figure [4] By measuring the scat- 
ter in the zeropoint offset from all the stars available in each 
frame, we find a mean uncertainty of 6% in the guider-based 
photometric correction. 

The flux calibration of IFS data was done in a manner sim- 
ilar to that for longslit spectroscopy, but with some additional 
steps to compensate for fiber sampling patter ns. We used 
the spe ctrophotometric stars and calibrations of iMassev et al.l 
(119881) observed under a six-dither pattern. Airmass ex- 
tinction coefficients for photometric conditions with a curve 
specifically modeled for McDonald Observatory are applied. 
This extinction curve is similar to the Kitt Peak curve sup- 
plied with IRAF. The bright standards allowed us to deter- 
mine both the source position relative to the fiber grid and 
the seeing Point Spread Function (PSF), which in turn yields 
the exact fiber sampling. In contrast, fainter emission-line 
sources re quire statistical sampling corrections that are dis- 
cussed in ^4.41 In order to determine the percentage of inci- 
dent flux captured over the six dither positions, we employed 
the following analysis. We began by considering the spectra 
for all fibers positioned within a large radial aperture (oper- 
ationally, 8") from the stellar centroid. and adopting a see- 
ing model with a 2D circular, Gaussian PSF. The broadband 
flux of each fiber was measured by summing over a large 
wavelength range (operationally, 4000A< A <5500A). The 
PSF and Gaussian normalization were determined through a 
nonlinear least squares minimization by assuming the spatial 
response of each fiber was tophat. The sampling correction 
was then formed from the ratio of the Gaussian normalization 
to the sum of the broadband flux measurements. Then, the 
spectral count rates of the relevant fibers were resampled to 
a common wavelength scale, co-added, and normalized using 
the sampling correction. By using such a broad, circular aper- 
ture, we ensured that the effects of atmospheric differential 
refraction on the co-added spectrum were negligible. The fi- 
nal spectral flux calibration curve was then formed from the 
ratio of the published, absolute flux density to the sampling 
corrected data count rate. Spectrophotometric standards were 
taken under a range of conditions, so their comparison re- 
quired a further correction for transparency as estimated from 
the guider measurements. Once done, we find an rms be- 
tween all flux calibration curves of 9.3% and 8.5% for FR1 
and FR2. We find no trend with wavelength in this scatter 
and so validate the assumed gray zeropoint correction for all 
guider transparencies at these levels of uncertainty. The fi- 
nal catalog will list the random line flux errors, but the whole 
sample may be considered to also be subject to the ~40% flux 
calibration systematic uncertainty just discussed. We do not 
fold the systematic into the tabulated values as relative com- 
parisons within the sample should not be subject to it. 

Several statistics from this flux calibration analysis summa- 
rize the survey's performance. First, the range of atmospheric 
transparencies for recorded data is shown in Figure |4] These 
statistics are biased against periods of weather too poor to at- 
tempt observation and represent only the best 60% by time. 
The median nonphotometric transparency penalty to this sur- 
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vey in the observable periods is 0.28 magnitudes. The to- 
tal system throughput is shown in Figure as the fraction of 
light recorded after passing through one photometric airmass 
(zenith), the telescope, the focal reducer, and the VIRUS-P 
instrument. The curves for the two focal reducers show a dra- 
matic difference: FR2 performs better than FR1 at all wave- 
lengths, but particularly in the blue where FR1 has only half 
the throughput of FR2. 

The combination of read noise, system throughput, and sky 
brightness determine the detection limit for an unresolved 
emission-line source. Figure [2] shows the 5 a limit in a detec- 
tion element (defined as ±2 x the instrumental dispersion or 
± 1 .9 x binned pixels), which is nominally the survey's photo- 
metric limit with some modulation for sources sampled under 
different fiber positions. The luminosity limit for LAEs is also 
sho wn i n Figure [2] The exact limits will be further explored 
in 8 14. 1 1 a nd c ompensated for with the completeness limit de- 
rived in i H.31 Finally, in Figure|6]we give the sensitivity maps 
at 4500A for spectrally unresolved point sources, taking into 
account mosaic overlap, bright objects, dead fibers in IFU-1, 
guider measured extinctions, and the range in airmass over 
the dataset. Small gaps in the map are due to the slightly 
different sizes of IFU-1 and IFU-2, and the failure to com- 
plete the desired six dither pattern in one COSMOS pointing 
by only completing a three dither pattern. Finally, five fields 
were chosen to overlap with previous fields for cases where 
transparency in the first pass yielded poor depth. 

2.5. Ancillary imaging 

This survey discovers and spectroscopically measures 
LAEs in one pass, as opposed to narrowband surveys that of- 
ten require spectroscopic confirmation on a subsample. The 
depth and bandpass restrictions of VIRUS-P, however, still 
make discrimination between LAEs and low-z contaminants 
challenging. For both LAEs and [Oil] emitters at many red- 
shifts, we expect to have only one strong emission line in the 
VIRUS-P bandpass. Respectively, [OIII]A5007, [OIII]A4959, 
and H/3 will be lost at z > 0.158, z > 0.170, and z > 0.193, 
and the survey's spectral resolution does not resolve the [Oil] 
doublet. Furthermore, the variation obs erved in local galaxies 
for strong line ratios dKennicuttll 19921) never guarantees that 
two statistically significant lines will be detected. By neces- 
sity, we resort to an EW cut, as used extensively in LAE nar- 
rowband surveys, to classify single emission-line detections. 
We discuss the EW cut further in §5\ However, the VIRUS-P 
spectra are not sufficiently sensitive for continuum detections 
for the majority of the emission-line detections. To reach the 
necessary limits, we must supplement the spectra with deep 
imaging. 

This dataset's fields are located in regi ons of the 
sky with existing deep images a nd catalogs dDrorv et alJ 
20011 li^mandez-Soto et all [19991 Iclipaklital] l20()il2007T: 
Ilbert et alJl2009l) . The XMM-LSS field does not have a pub- 
lished catalog but is covered by the Canada-France-Hawaii 
Telescope Legacy Survey 16 (CFHTLS) wide field Wl. The 
deep MUNICS images, which were not part of the original 

16 Based on observations obtained with MegaPrime/MegaCam, a joint 
project of CFHT and CEA/DAPNIA, at the Canada-France-Hawaii Tele- 
scope (CFHT) which is operated by the National Research Council (NRC) of 
Canada, the Institut National des Science de l'Univers of the Centre National 
de la Recherche Scientifique (CNRS) of France, and the University of Hawaii. 
This work is based in part on data products produced at TERAPIX and the 
Canadian Astronomy Data Centre as part of the Canada-France-Hawaii Tele- 
scope Legacy Survey, a collaborative project of NRC and CNRS. 



publications, consist of Bj, g', i', and z' data taken with the 
Large Area Imager for Calar Alto (LAICA) on the Calar Alto 
Observatory 3.5m, with zeropoints made by matching stel- 
lar photometry to the published catalog. Instead of using 
the literature catalogs, we have chosen to produce our own 
SExtractor catalogs on the images and error maps; this en- 
sured a consistent analysis for the fields and pushed the S/N 
to a lower threshold for a more complete emission-line asso- 
ciation. We list select properties of the relevant broadband 
data in Tabl e [2] The table also gives the Galactic extinc- 
tion values (ISchlegel et alJ [19981) we applied to the contin- 
u um and emission-l ine fluxes under the extinction curve fit 
of lO'DonnelH ( [19941) . 

Care was taken in the photometry to ensure our photo- 
metric colors were robust. Two measures of seeing FWHM 
are relevant : the one for the particular band where a Kron 
(lKronlll980h aperture is measured (FWHMif ron ) and another 
larger value to which the other photometric bands will be 
matched (FWHM matc /,). For each field, we formed a de- 
tection image by stacking the deeper available bands with- 
out matching each band's seeing (see Table [2]). The detec- 
tion parameters of SExtractor were then set to find a min- 
imum of three neighboring pixels detected with ltr signifi- 
cance over sky without filtering. Since we will only be using 
sources with 3a significance in their photometry, the exact 
detection weights and filters have little importance. Also, the 
return of spurious continuum sources from the low signifi- 
cance thresholds is acceptable for our application. A chosen 
band with good depth for each field, labeled here as i, was 
compared to the detection image using SExtractor dual image 
mode, in order to measure flux densites in a blending cor- 
rected Kron aperture, f u ^.Kron- The Kron ellipse dimensions 
a and b were also measured. Blending correction was crudely 
accomplished with the SExtractor AUTO flux measurements 
and the flag MASK_TYPE set to CORRECT. Under this setting, 
SExtractor sums the flux from the opposite side of the Kron 
aperture whenever it encounters pixels covered by multiple 
Kron apertures. In the remaining bands, labeled here as j, 
each frame was matched in seeing to FWrlM mafc ft and run 
in dual detection mode to measure the flux density in a cir- 



cular aperture of diameter 1.4xFWHM 



match: 



. The 

term f corr = (1 — e~°' 5afc / <T A'>-°™ ) then forms a correction 
factor for the fraction of flux lost to the Kron aperture from 
a point source under seeing with dispersion (J Kron- The fi- 
nal aperture-corrected flux density in each band j was then 
estimated from Equation [TJ Standard error propagation was 
applied. 



fv,i,Kron fv,j,< 



x fee 



(1) 



This resultant source catalog was used only in cross- 
correlation with our VIRUS-P emission-line catalog to iden- 
tify object counterparts. The method of assigning counter- 
parts is described in ||5] The emission line fluxes are sub- 
tracted off from the broadband measuremen ts according to 
the fil ter transmission curves as supplied by iBrammer et alJ 
d2008l) once counterparts are assigned. 

3. DATA REDUCTION 

The science goals of this survey required the develop- 
ment of a custom reduction p ipeline. Several IFS reduc- 
tion p i pelines already exi s t (e.g. | Val des 1992 :lZanichelli et alJ 
120051: iTurner et al.ll2006t ISanchezll2006t ISandin et alj|2010l) 
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and are well suited to many applic ations. In particular , we first 
tried using a predecessor of p3d (ISandin et al.ll2.Qldt iBeckerl 
120021) . The crucial limitation of the p3d package and all other 
IFS pipelines at the time, is that they resample the spectrum 
of each fiber onto a common wavelength scale early in the 
processing. This step correlates errors and complicates the 
detection statistics. In fact, we found by running simulated, 
source-less VIRUS-P data through p3d that many more res- 
olution elements were flagged to have 5a significance than 
was possible from the input Poisson statistics. The use of p3d 
would have either produced too high a contamination fraction 
or required higher S/N cuts and survey flux limits. This con- 
sideration led us to develop a set of scripts and FORTRAN 
routines collectively called Vaccine. Many of the pipeline 
steps are standard to all spectroscopic reductions. However, 
the primary Vaccine requireme nt to avoid da ta resampling is 
done in a manner similar to the lKelsonl (120031) pipeline devel- 
oped for longslit spectroscopy and affects the flat fielding and 
sky subtraction steps. 

3.1. Preliminaries 

The first operation done to each VIRUS-P frame is to mea- 
sure a single bias value from the overscan regions, subtract 
it from the frame's data section, and trim the overscan. A 
master bias then is created from all the overscan-subtracted 
biases taken during an observing run (typically 100 to 200 
frames). Overall, the noise statistics in bias frames were re- 
markably stable and indistinguishable over weeks. Next, we 
cleaned the images with a bad pixel mask made by exposing 
the camera to scattered white light and finding the pixels with 
relative quantum efficiency outside 10% of the CCD's me- 
dian. The VIRUS-P CCD has very clean cosmetics: besides 
the two rows nearest the readout register, this bad pixel mask 
only contained thirteen total pixels in three patches. Data 
combination for all co-additio ns of frames is ac complished 
using the biweight estimator (Be ers et al.l Il990h : this algo- 
rithm was chosen for its robust performance regarding outliers 
such as cosmic rays. The master bias and individual overscans 
are subtracted from all calibration, science, and flux standard 
frames. Calibration frames, consisting of arc frames and twi- 
light flats, are taken at the beginning and end of each observ- 
ing night. The dawn arcs and flats were preferentially used 
over those frames taken in dusk, as they were a better match 
to the temperature of the night-time conditions. 

As is common to both IFS and slitlet multi-object spec- 
troscopy, the traces of all fibers are not strictly parallel to the 
CCD pixels or to each other. The fiber profiles, taken from 
a flat field calibration, must be traced to define an extraction 
aperture of each fiber. Moreover, the dispersion axis is not 
necessarily parallel to each fiber's trace. However, with the 
camera alignment in VIRUS-P, we found the maximum devi- 
ation of this misalignment is 0.2 resolution elements, so we 
ignored this distinction and defined the dispersion axis along 
the fiber trace to be perpendicular to the cross-dispersion di- 
rection. This assumption effectively broadens, slightly, the 
resolution in some fibers. The tracing is then made by fitting 
Gaussian functions to cuts along the cross-dispersion axis at a 
series of wavelengths for each fiber. The Gaussian centers are 
fit by a fourth order polynomial across the CCD. This fit was 
tested against repeated flats and shown to be precise to < 0.1 
pixels across the CCD. Trace information is displayed for the 
user, who can iterate the fit tolerances if required. All fur- 
ther operations are done in the traced coordinates with cross- 
dispersion apertures of five pixels. Vaccine propagates errors 



for all operations starting with the read noise and keeps track 
of the Poisson noise from sources and the background sky. 

3.2. Wavelength calibration 

An automated peak finding algorithm is run on the arc lamp 
frames, and line identifications are made from a user entered 
initial wavelength solution. Typically, seven unblended HgCd 
lines are found with their central pixel locations determined 
by a Gaussian fit to the line profile. The pixel-to-wavelength 
mapping is then fit with a fourth order polynomial in the dis- 
persion direction. The first order term of that polynomial 
is found to vary smoothly for all fibers as a function of the 
cross-dispersion direction. Hence, or increased accuracy, this 
first order term is refit as a function of the cross-dispersion 
distance from the camera optical axis using a fourth order 
polynomial. Finally, the wavelength polynomial as a func- 
tion of dispersion direction pixel is refit, this time with the 
constrained first order term. The residuals of this procedure 
are typically one hundredth the size of a resolution element 
and the solutions are stable to a tenth of a resolution element 
over several weeks. 

The heliocentric correction is found for each frame by us- 
ing a FORTRAN implementation (written by G. Torres 17 ) of 
the IR AF task bcvcorr in the rvsao package dKurtz & Minkl 
1 19981) . The small, <1 km s _1 differences in heliocentric ve- 
locities for exposures at the same dither position but taken 
over different nights are ignored and only the mean heliocen- 
tric correction between them is applied. All reported wave- 
lengths are in the heliocentric frame. A correction to vacuum 
conditions is made assuming an index of refraction for air of 
n=1.0002 for all observed wavelengths. 

3.3. Flat fielding 

Typically fifteen twilight flats were taken each night and 
combined using the biweight estimator. To ensure high S/N 
in the twilight flats, each frame was exposed to near but below 
the CCD's 1% nonlinearity specification which occurs at 50% 
of full well. Four signals are present in the twilight flats, 1) 
the solar spectrum, 2) the relative throughputs between fibers, 
3) the fiber profile in the cross-dispersion direction, and 4) the 
relative pixel-to-pixel resp onses. To remo ve the first of these 
we employed a bspline fit dDierckxIl 19931) constrained by in- 
put from large subsets of fibers. Such a fit is robust against 
outlier datapoints (i.e. our cosmic rays or faint sources that 
fill a subset of the data) and fits curvature that a linear inter- 
polation would miss. The advantage of the bspline fit is best 
leveraged when a spectrum is highly supersampled, and the 
camera's optical distortions naturally deliver this quality in 
different fibers, predominantly as a smooth function of cross- 
dispersion direction. However, the slight (10%) spectral reso- 
lution variation across the CCD disfavors a single fit for all the 
fibers' data. As a compromise, we consider each fiber with its 
twenty nearest fibers in CCD coordinates. Within these sets 
the spectral resolution variations at any wavelength are less 
than 2%. We do not make more complicated corrections for 
the spectral resolution variation beyond this. The bspline fit 
for each fiber, serving as a model of the solar spectrum, is then 
divided into the original flat field data, resulting in a precision 
between different sets of frames to < 1 % rms. 

3.4. Background subtraction 
17 http://tdc-www.harvai'd.edu/iraf/rvsao/bcvcorr/bcv.f 
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The majority of VIRUS-P fibers and resolution elements in 
this blind survey record blank sky. This enables the noise if 
our sky model to be driven down by stacking measurements 
over many fibers, so long as the noise is statistical. By us- 
ing the 50 nearest fibers in the cross-dispersion direction, the 
statistical noise in the sky can be reduced to only 14% of a 
single fiber's noise. In this way, the uncertainty in the post- 
sky-subtracted data can be made very close to that of the pre- 
sky-subtracted data (as long as the flat-fielding systematics 
are understood). Our sky background models were formed 
identically to the flat field models. 

We note, however, that this semi-local sky estimation 
method is only robust for sources that fill a small fraction 
of the combination window, which on-sky is approximately 
Aa=100" by A<5=20". No bright, broadband sources have 
such sizes in the survey fields. Moreover, in order to further 
avoid oversubtracting bright sources, we constructed an ob- 
ject mask prior to the bspline fit. Any fibers that yield >2<r 
significance in the continuum, as estimated by combining the 
data and errors across all VIRUS-P wavelengths, were placed 
in the object mask. 

3.5. Data combination 

The count rates in the three frames taken at each dither po- 
sition were first corrected by the airmass-based photometric 
extinctions and the guider-based transparency measurements 
and then combined. The three frames and the 5 pixel cross- 
dispersion aperture delivered fifteen input values to the bi- 
weight estimator at each wavelength. The VIRUS-P flux stan- 
dard frames are passed through Vaccine exactly as the pri- 
mary science data. Finally, the scie nce spectra (and errors) are 
scaled by the flux calibration ( i!2.41 i to form a set of calibrated, 
one-dimensional spectra at each fiber and dither position. 

3.6. Systematic errors 

We identify three potential sources of systematic error in 
VIRUS-P data, one unimportant, and two that require moni- 
toring. First, we discuss why crosstalk between fibers is not 
important in VIRUS-P data. Next, we identify the effects 
of throughput variations and the accuracy of flat field cross- 
dispersion profiles on the error budget as the most prominent 
systematics. Finally, we describe an empirical, frame-specific 
estimate of the systematics that must be added to the random 
errors. 

IFS crosstalk occurs when the profile of a fiber in the cross- 
dispersion direction significantly overlaps that of other fibers 
projected nearby on the CCD. We make no crosstalk correc- 
tion in Vaccine for two reasons. First, the fibers are mea- 
sured to have cross-dispersion profiles of 4 pixels FWHM 
size. This is a factor of 2 smaller than the center-to-center 
fiber spacing on the CCD and larger than that found in many 
IFS instruments. As a result of our 5 pixel extraction aperture, 
sources of equal strength in neighboring fibers imply only a 
< 0.5% contamination. Second, the blind field selection of 
this survey leaves most fibers seeing only uniform sky back- 
ground and leaves little risk from cross-talk contamination. A 
fiber aligned on a source will usually be isolated and trade an 
equal flux from the backgrou nd s ky with its crosstalk neigh- 
bors. The flux calibration ( i]2.4| ) steps use the same cross- 
dispersion aperture, and therefore correct for the source flux 
lost by crosstalk. 

The stability in the throughput of fibers can cause signifi- 
cant systematic errors in some measurements. As discussed 



earlier, our fiber throughput is very stable, with 1 % rms vari- 
ation at worst and 0.3% median variation over a night. How- 
ever, our background sky is 25-40 x stronger than the statisti- 
cal noise limits in each resolution element. As a result, the 
systematics can overwhelm the statistical errors in spectral 
apertures of six resolution elements or more during the worst 
stability conditions. Continuum estimates using large wave- 
length ranges may thus be severely affected in our survey, and 
we make no claims on such properties. However, the situation 
for emission lines is far better. First, the systematics in a de- 
tection element (approximately two resolution elements) are 
at worst 56% of the statistical error and at median are 13% be- 
fore background subtraction. Second, most of the throughput 
variation is c aptu red in the background subtraction step. As 
described in £14.11 before we detect emission lines we subtract 
off a locally estimated continuum value using roughly ninety 
independent spectral pixels. Since fiber throughput variations 
manifest uniformly across wavelength, the spurious signal is 
a small multiple of the sky spectrum and relatively featureless 
over our bandpass (exempt for the bright [OI] 5577 A sky line 
which we mask prior to all detections). The systematic error 
in a post-background-subtraction detection element therefore 
drops to 5.9% of the statistical error in the extreme case and 
1 .4% of the statistical error in the median case. We include 
this systematic uncertainty in both the detection and flux cal- 
ibration error budgets via the empirical correction described 
below. 

The final known source of systematic error is occasional 
variability in the cross-dispersion profile that occurs with time 
and temperature for different fibers. These profile changes 
can appear as both a trace position shift and a width change, 
and while small, are important. Between twilight flats spaced 
eight hours apart and through maximum dome temperature 
changes of ten Celsius degrees, we have measured trace shifts 
of up to 0.3 pixels and profile FWHM changes of 0.3A. Our 
goal was to limit this systematic to 10% for any pixel in the 
flat. The FWHM variation already meets this criterion, but the 
maximum trace shift is too large by a factor of six. Moreover, 
although the trace shift also appears to be coherent between 
adjacent fibers on the CCD, it sometimes goes in opposite 
directions at the opposite ends of the fiber bundle, as if the 
traces are subject to a "breathing mode." We have developed 
a heuristic solution that mitigates this problem. The core idea 
is to measure the offset over subsets of fibers, alter the flat 
fields to maintain the fiber-to-fiber and pixel-to-pixel patterns 
but resample the fiber profile to produce a shifted flat tailored 
to each exposure. 

For each pre-sky-subtracted data frame, the fiber centroids 
at each wavelength along the cross-dispersion direction are 
calculated with respect to the corresponding flat. These trace 
shift estimates are then median smoothed with their twelve 
nearest fibers on the CCD. Rather than presume a cross- 
dispersion profile shape, which displays non-Gaussian fea- 
tures, we use sine interpolation to resample the profile. Lin- 
ear interpolation fails to recover the strong curvature in this 
profile. In each fiber and each wavelength, the flat field is 
resampled at the fiber-specific estimated offset relative to the 
polynomial trace peak. However, additional smoothing is still 
required to leave pixel-to-pixel features unaltered. To do this, 
we run a boxcar smoother of eighty-one pixels along the dis- 
persion direction for both the original flat field and the sine 
resampled flat field. The biweight of each forms a pure profile 
model in the original and resampled frames, and the pixel-to- 
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pixel variations are isolated in a separate image. The total, 
shifted flat is then formed by multiplying the pure, shifted 
profile model by the isolated pixel-to-pixel estimate. A final 
scaling is then applied to maintain the fiber-to-fiber through- 
puts and total flat normalization, as sine interpolation does 
not automatically conserve flux. The use of these shifted flats 
rather than the original flats results in lower systematic errors 
and meets the goal of < 10% flat field profile error. 

To capture any remaining systematics we have made a sec- 
ond, independent estimate of the error using the rms of the 
fi fteen measurements that go into the final data combination 
(E3J. This error estimate is itself noisy, but the ratio be- 
tween this empirical error and our formal error over all pixels 
is useful as a diagnostic. We find the median of this ratio per 
frame is 0-20% above the random noise alone, and the me- 
dian over all data is 5%. Therefore, we increase the errors 
by this amount prior to the detection steps. Figure Q shows 
the distribution of all 87.9 million independent datapoints di- 
vided by the error estimates of this dataset. Versions prior to 
and after continuum subtraction are shown. If the dataset were 
entirely without signal, if all the systematics were understood, 
and if all the noise were uncorrelated, the distribution should 
match the given Gaussian function with a dispersion of unity. 
Clearly the distribution is asymmetric, distorted on the posi- 
tive end by signal and the negative end presumably by the pre- 
viously discussed fiber throughput variations. However, the 
continuum subtracted data with the fiber throughput variation 
removed are much more symmetric and show a distribution 
that is a much better match to the Gaussian width. Emission 
line objects are detected in the continuum-subtracted data, and 
the noise model is validated. 

4. EMISSION LINE SOURCE SELECTION 

The controlled selection of emission-line objects is the next 
step in producing this survey's catalog. The primary task 
of the detection process is to optimally use the source sig- 
nal that has been distributed into, potentially, several fibers. 
The challenge is to push to a high completeness level at low 
S/N under a contamination constraint. The approach we adopt 
is to define emission-line detection seed apertures at a low 
S/N significance, test the combination of the seed apertures 
and all nearby fibers on sky, and allow the aperture to grow 
if the significance of the encompassed signal increases. The 
growth process is iterated. To understand the completeness 
and contamination rates of this method, we also present sim- 
ulations with mock data. In similar datasets such as blind 



longslit s pectroscopy ( Gilbank et al J 120 1 Oh and grism spec- 
troscopy dMeurer et al. 120071) . detection algorithms based on 



data convolution have been used. We have tested this ap- 
proach on our dataset, but fo und i t inferior in completeness 
to our adopted technique (see ^4.3b . 

4.1. Detection method 

Several terms require definition before we describe the de- 
tection method. A fiber position carries a set of neighboring 
fibers, defined as all other fibers offset by < 3" in their center- 
to-center coordinates. The detection aperture starts with one 
fiber and, by iteration, is allowed to grow by accepting neigh- 
boring fibers. A detection aperture may be composed of mul- 
tiple fibers and has its own set of neighbors, defined as the 
union of all neighbors to the current member fibers. The S/N 
of a potential emission line is calculated in a specific spec- 
tral window around the fit central wavelength. We define this 



detection window as spanning ±2a res where a res is the dis- 
persion of the VIRUS-P resolution element (2.2A). Within 
this window, data are summed and errors added in quadra- 
ture. Pixels that straddle the window are included by their 
fractional overlap. 

We begin with the fully calibrated spectra, errors, and fiber 
sky coordinates. First, a local continuum for each fiber is 
estimated and removed through a 200A wide biweight box- 
car. Second, seed apertures are defined as all pixels that have 
lcr positive significance under a 6A wide boxcar smoothing. 
Seeds are merged when found in the same fiber and at con- 
tiguous wavelengths. Third, a Gaussian model is fit to each 
seed with variable width, wavelength, and intensity using a 
data window of 30A. We anticipate emission-line widths for 
LAEs to lie below the VIRUS-P spectral resolution, but the 
detection method is designed to be general to all line widths. 
We experimented with basing the detection aperture on the 
Gaussian function's fit width instead of the instrument's res- 
olution, but simulations showed that the broad fits produced 
an unacceptable level of contamination. Fourth, fits with the 
seed apertures and each of the neighboring fibers are made. 
When making fits using multiple fibers, each fiber's emission- 
line intensity is allowed to vary, but constrained to a common 
wavelength and width. Fifth, if the inclusion of any prospec- 
tive neighboring fiber increases the total S/N over a particu- 
lar threshold, the fiber with the greatest increase is added to 
the detection aperture. Operationally, we use a threshold of 
AS/N — 0.3. Sixth, these steps are iterated until the aper- 
tures no longer grow or the aperture size reaches six fibers. 
The cut at six fibers is chosen because in the dither pattern, a 
point source can be equidistant from at most six fibers. Sev- 
enth, a final significance cut is made on the potential detec- 
tions. If the detections had only been made using single, inde- 
pendent apertures, simple counting statistics could be used to 
meet the < 10% contamination g oal. For example, whe n ap- 
plied to the luminosity function of lGronwall et alj d2007l) . our 
S/N>5 cut and no galactic extinction implies that we should 
see 2.4 LAEs per VIRUS-P pointing under photometric con- 
ditions. Similarly, a VIRUS-P pointing (over six dithers) has 
756k independent resolution elements, so a S/N>5 cut would 
deliver 8% contamination. Unfortunately, the more compli- 
cated detection algorithm used here is not so straightforward 
to assess. While the growth steps will recover some sources 
that would otherwise be missed, they can also bundle noise 
from neighboring fibers. We therefore have made simula- 
tions of mock noise frames in order to optimize our selection 
thresholds. 

4.2. False source tests 

To test for false sources, we began by simulating full, two- 
dimensional spectral data for twenty-five VIRUS-P fields us- 
ing the observed median sky brightness. The mock data were 
made with noise realizations from the actual sky background 
and CCD read noise but were otherwise without sources. The 
fiel ds w ere then analyzed for emission-line sources exactly as 
in i j4.1l for all detections that reached S/N > 3. The number of 
spurious sour ces were then compare d to the expected number 
of true LAEs dGronwall et al.ll20071) as a function of S/N cut, 
aperture, size, and survey depth. Evidence indicates that the 
LAE luminosity fun ction does not evo lve strongly at z — 3 
and higher redshifts (Ouchi et al. 2008), but there is less cer- 
tainty about th e rate of evolution over the lower redshif ts that 
we also probe dNilsson et al.ll2009fc ICassata et al.ll2010l) . The 
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results of this analysis are shown in Figure [8] Interestingly, 
at higher S/N the larger apertures begin to contribute the most 
contamination. Under the typical survey observing conditions 
and the majority (^80%) of source-fiber geometries, the op- 
timal number of fibers to include in a simultaneous detection 
is two. Point sour ce emission objec ts, which we anticipate 
most LAEs to be dBond et al.1 120101) . rarely (< 5%) benefit 
from fiber apertures of four or more. Conversely, the S/N for 
extended low-z objects is often improved by including more 
fibers, so we should not avoid large apertures altogether. Fi- 
nally, it is clear that a common cut of S/N > 5 would deliver 
an unacceptably high rate of (60%) contamination. The situ- 
ation can be improved by varying the S/N limit as a function 
aperture size. The choice we adopt is for an aperture of N 
fibers to have a S/N cut of S/N > 5 + 0.3 x (N - 1). Under 
the assumption of a non-evolving LAE luminosity function, 
we predict a 10%± 1 .6% contamination of spurious sources to 
the LAE sample. We project there are 17±3 spurious sources 
in the data catalog. A sample essentially free of contamina- 
tion can be produced by using this catalog with a S/N> 6 cut, 
which by the limited number statistics of these simulations 
may contain 0^q spurious sources. 

In addition we have also performed an empirical test for 
spurious sources by analyzing the inverse of the survey data 
frames. All sources with a detected continuum were masked 
(so that we would not find the inverse of absorption features 
as spurious sources), and our detection algorithm was re-run. 
This analysis found 7 spurious sources in 28 fields; a rate that 
is significantly lower than that estimated from the simulations. 
This suggests our estimate of the systematic error is conserva- 
tive and the true contamination fraction likely lies somewhere 
between 4-10%. 

4.3. Completeness tests 

Not every source at the flux limit of Figure |2] will be recov- 
ered by the detection scheme. Beyond the usual statistical 
fluctuations introduced by noise, different source positions 
and seeing variations will cause the signal to be distributed 
over a different numbers of fibers and cause varying fractions 
of light to be lost to the gaps between fibers. While this par- 
tial image sampling is an undesirable feature, IFS mitigates 
these u ncertainties compared to serendipitous longslit obser- 
vation s dRauch et al.ll2008t ILemaux et al.ll2009HCassata et al.l 
120101) . where the slit losses can range (nearly uniformly) from 
0-100%. 

We have simulated our completeness limit using 25 mock 
fields of full, two-dimensional data with noise generated from 
the mean McDonald sky spectrum and the CCD read noise. 
Each simulated image contained 3000 emission-line sources 
randomly chosen in position and wavelength, but constrained 
to avoid object blending and spaced by the seeing from the 
IFU edges. We used the same detection routines as for the 
real data. For all these simulations, the seeing was held con- 
stant at the survey's l'/5 FWHM median. These mock sources 
were modelled as spectrally unresolved poi nt sources with 
fluxes randomly drawn from an unevolving iGronwall et al.l 
(120071) LAE luminosity function over the luminosity range 
41.5 < logL(ergs _1 ) < 44.5 where the lower bound was 
chosen to yield S/N=0.5 over most of the wavelength range. 
Figure [9] compares our simulated emission-line fluxes to the 
fluxes that were measured. As the S/N decreases, the er- 
ror in our measurements increases. Moreover, at the faintest 
limits, there is a slight systematic trend, with the measured 
fluxes being over-estimated. This is the well-known Edding- 



ton ((Eddington 19131 119401) correction which, if ignored, can 
lead to an under-estimate of a luminosity function's slope. 
The least-squares fit shown in the figure will be used to statis- 
tically correct all our LAE fluxes prior to luminosity function 
computation. The completeness results are shown in Figure 
[Tol We reach 50% and 95% corrected completeness at 5.6a 
and 8.3cr respectively. Compared to a step function complete- 
ness limit at S/N > 5 at the photometric limit of this sur- 
vey which we consider the ideal goal, the number of detected 
LAEs is degraded by 13%. The long, low S/N tail helps miti- 
gate the loss of objects to the non-ideal completeness. 

Our source simulations also allow us to quantify the sta- 
tistical astrometric error as a function of S/N. This is an im- 
portant ingredient to our algorithm for associating VIRUS-P 
emission-line objects with sources found in broadband imag- 
ing (see jj3). If we adopt a Rayleigh dsibtribution for the form 
of the radial errors, i.e. a = a + b/(S/N), then a maxi- 
mum likelihood fit for the coefficients yields a = 0'.'348 and 
b = 2'.'04. Figure QT| shows this relation, with the individual 
measurements overplotted. 

The large VIRUS-P fibers lead to poor spatial resolution. 
Nevertheless, we have also simulated one mock field of 3000 
point sources at and above the survey's flux limit and seeing 
distribution in an effort to quantify the minimum resolvable 
source size. To do this, we modeled the seeing FWHM dis- 
tribution as a Gaussian function centered on l'/5 with a dis- 
persion of 1" but truncated below l'/2. With the oversampled 
pattern of dithers, we expect the Nyquist limit to be near the 
diameter size of a fiber. The s ame curve-of-growth photom- 
etry routines as described in A4.4I were used to measure the 
sizes of simulated point sources. Figure [T2l shows the distri- 
bution of emission line flux and measured size. The distribu- 
tion is mostly flat with either flux or source S/N. Based on the 
simulation, we label a threshold of 7'/5 as the resolution limit 
of our survey. This can be compared to the usual definition for 
Lya blobs, i.e. emission over an isophotal area of > 16D" at 
a certain surface brightn ess t hreshold. The Lya blob surveys 
of lMatsuda et alj ( 120041) and lYang et"afl ( 120101) used thresh- 
olds of 2.2 x 10~ 18 erg s -1 cm -2 arcsec -2 and 5 x 10 -18 erg 
s" 1 cm" 2 arcsec" 2 , respectively. Our HETDEX pilot survey 
should detect many Lya blobs based on this flux limit, but 
will only be able to resolve the very largest objects. The full 
HETDEX survey will have ~3x better spatial resolution. 

4.4. Line flux measurement 

A source's detection aperture described in Sj4j] does not 
contain the total source flux. The imposed S/N cut omits some 
fraction of the flux in the detection aperture; this fraction is a 
function of source strength and orientation to the fiber dither 
pattern. In order to determine an unbiased emission line flux 
in the presence of these complications, we describe here a 
curve-of-growth procedure used to measure a source's total 
line flux after detection. While other total flux estimators are 
possible, we advocate this method as generally robust against 
the range of sizes and morphologies encountered in the sur- 
vey and the rather large astrometric errors and seeing varia- 
tions inherent in t his dataset. Th e algorithm is similar to curve 
of growth (CoG) (IStetsonll 19901) fits previously developed for 
CCD imaging photometry, but is new to spectrophotometry. 

We begin a flux measurement by considering the positions, 
central wavelengths, and line widths (adet) obtai ned f rom the 
emission-line detection algorithm described in 8 14.11 A cir- 
cular aperture is formed around the centroid emission-line 
position of variable radius. Fibers overlapping this aperture 
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are given fractional weights determined by their enclosed ar- 
eas. Specifically, we form fifteen apertures linearly spaced 
between radii 2'.' 2 and 9'.'0. In each aperture, the enclosed 
fibers have their continuum-subtracted data summed and er- 
rors summed in quadrature for wavelengths within ±2a res of 
the detection wavelength. A spectral correction factor is de- 
fined as the flux fraction of a Gaussian line profile that falls 
within the fixed, spectral window defined by Equation [2] 

fspec,corr = erf (\[2~O res / J <T* es + 0^) (2) 

Note that the fluxes returned by directly summing all fibers 

in a circular aperture of radius r, f(r) raw , may oversample 
or undersample the source flux depending on the data com- 
pleteness and overlap regions of mosaic. For example, the 
ideal six dither pattern produces an oversampling of very near 
two. Let the number of fibers at a particular position lying 
within one fiber radius, r«6, be N(Ar < r/i(,,r, 0) in po- 
lar coordinates. Equation[3] gives the raw flux measured for 
arbitrary sampling of a source with total flux f total and nor- 
malized profile P(r, 9); f(r) samp = /J" f total P(r,6)rdrd6 
is an estimate of the cumulative flux corrected for sampling. 
This approximation is correct when N(Ar < r/«,, r, 9) does 
not systematically depend on r, which is nominally true for 
the randomly positioned observations presented here. The ap- 
proximation is necessary to cleanly estimate an unbiased flux 
without knowing the exact profile. 

f{r) raw = f f tot aiN(Ar<r fib ,r,6)xP(r,6)rdrd6K 
Jo 

/; N(Ar <r flb ,r,9)rdrd9 
jyXjsamp ^ o 

7T7" Z 

(3) 

We fit, by nonlinear least squares minimization, a cumu- 
lative two-dimensional Gaussian function, Aq d g x (1 — 
e -o.5r l<y- CoG ^ t0 t j ie highly correlated distribution f(r) samp , 
where we enforce the limits 1" < <tc g < 10". In addition, 
we create Monte Carlo realizations by varying each fiber's in- 
tensity from the best-fit model. The CoG datapoints are highly 
correlated, so we took care to estimate the errors from the un- 
corrected data of each fiber. The final, total flux estimate is 
given by Equation [4] with errors similarly propagated from 
the raw data and the uncertainty in ucoG- Figure [13] gives 
curve of growth examples for an [Oil] emitter and a LAE. 

ftotal — A-CoG / fspec,corr (4) 

We tested the reliability of the curve-of-growth flux mea- 
surement, particularly for correlated errors wi th th e source 
size, by using the simulated data discussed in $4.3\ We first 
meas ured the flux from the fibers chosen as the detection aper- 
ture ( £14. It , and compared this to the simulated flux. The mean 
and dispersion of the measured-to-simulated ratio are 0.93 
and 0.31; unsurprisingly, the fluxes are systematically under- 
estimated. Next, the set of all fibers within 6" of the detected 
position was used as the flux aperture. This reduced the scat- 
ter found by the fixed aperture method, but a systematic error 
still remained with a mean of 0.94 and dispersion of 0.20. Fi- 
nally, the curve-of-growth flux measurement was considered. 
Under this procedure, the bulk systematic flux measurement 
error vanished, giving a mean of 1 .00 while still maintaining a 
low dispersion of 0.23. All three flux estimation methods are 



shown in Figure [14] against the simulated source size. A sys- 
tematic offset with input source size can be seen for all cases, 
but the curve-of-growth photometry is preferred as the least 
biased method investigated. 

5. SOURCE CLASSIFICATION 

An emission-line galaxy catalog is of limited value without 
secure redshift identifications. Unfortunately, the uncertainty 
in identifying single emission line s is a common hin drance to 
high-redshift galaxy surveys (e.g.. lSterrT et al. 2000). We here 
describe the two steps necessary to robustly assign redshifts to 
the emission-line catalog. Tables [3]and[4]present the cata logs. 
We give the detailed description of these tables in %.2\ We 
further summarize the statistics of commonly found objects 
and compare the sample to other works where available. 

5.1. Spectral classification 

As mentioned in £|2.5I the presence of multiple, strong 
emission-lines can be used to identify some low-z objects, 
but the absence of such lines is not sufficient evidence to clas- 
sify a source as an LAE. We begin all source classifications 
by cross-correlating the primary emission line at various as- 
sumed redshifts to other bright, expected emission lines. We 
automatically search all the detection spectra for MgII2798, 
[OII]3727, H 7 4341, H/34861, [OIII]4959, and [OIII]5007 as- 
suming the detected line to be, variously, [OII]3727, H/34861, 
[OIII]4959, and [OIII]5007. At high redshift we test Lya for 
the presence of CIV 1549. We have manually tried using the 
other, commonly weaker lines as confirmation of the primary 
detection, but have found only two cases of interest. For emis- 
sion line index 4 of Tables [3] and gj the CIII]1909 line is de- 
tected with the also-significantly detected [Oil] 3727 line of 
index 5. For emission line index 85 of the same tables, the 
broad MgII2798 line is brighter than the also-significantly de- 
tected [Oil] 3727 line. We have also mis-identified index 400 
as an [Oil] emitter; it is known to be an [OIII]5007 emitter 
from the literature (Bargeret all 120081) . but we find no other 
detections at other wavelengths. 

A demonstration of this cross-correlation process for a 
multiple-emission-line source is shown in Figure [15] We find 
only two cases where the correlation against data below the 
catalog signal-to-noise cut aids classification as shown in Fig- 
ure[l6] The first is emission line index 234, which is formally 
a single emission line detection. However, we find that an 
identification of the primary line with [OIII]5007 leads to a 
S/N=3.2 detection at the wavelength of [Oil], a S/N=5.1 de- 
tection at H/3. and a S/N=3.6 detection at [OIII]4959. The 
second case is emission line index 430 which is also a single 
emission line detection. We again find that an identification 
of the primary line with [OIII]5007 leads to a S/N=3.7 de- 
tection at the wavelength of [Oil] and a S/N=2.9 detection at 
H/3. In practice, the primary utility in the emisison line cross- 
correlation is to discriminate between various low-z possibil- 
ities with high S/N detections. 

5.2. EW-based classification 

In any LAE survey at sufficient redshift, the most likely 
contaminants are [OIII]5007 and [OII]3727. Many of the for- 
mer objects can be identified by the presence of [OIII]4959 
or H/3. The latter may be identified by either splitting the 
[Oil] doublet, or by using line equivalent width as a discrim- 
inant dCowie & Hulll998l) . Since we lack the r esolution to 
split [Oil] 3727, we follow iGronwall et all ( 12007b and require 
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LAEs to have EW reS ( > 20AA number of different EW es- 
timators are possible with measurements in many filters. We 
look at two ways to estimate the EW using broadband data, 
and conclude that the cleanest selection of LAEs is obtained 
when the i?-band data is used alone. 

The observed wavelengths and EWs are shown in Figure 
[T7l Emission lines without counterparts are shown as lim- 
its. We calculate the EW first by using the nearest-available 
filter that lies redward of the entire sample. For XMM- 
LSS, GOODS-N, and COSMOS, this is the i?-band. MU- 
NICS lacks an i?-band image, so we used i'. The redward 
choice is important to avoid attentuation by the intergalac- 
tic medium (IGM) for these data and the Lyman break. Al- 
though there may be some diversity in LAE dust content 
dFinkelstein et al.l2009l) . it appears most LAEs at ourredshifts 
of interes t have only small amounts of dust and exhibit flat 
continua dGawiser et al.ll2007t iGuaita et al.ll2010t iBlanc et alJ 
l2OT0h . Of course, low redshift, star-forming galaxies may 
also exhibit flat continua or, more likely, some level of a 
Balmer/4000A break, but by extrapolating the continua from 
the i?-band, the low redshift EWs will be somewhat underes- 
timated while the LAE EWs should remain unaffected. Still, 
while such a property is beneficial to the classification pro- 
cess, an unbiased EW is also desireable for physical stud- 
ies. So, we next calculate the EWs in the right panel of Fig- 
ure[l7]by interpolating each emission-line with the two near- 
est, bounding broadband filters. Clearly, the high and low 
EW populations have more overlap in the interpolated EW 
measurements. For this reason, we adopt the i?-band EW in 
our classification scheme. Figure [T8l shows the emission line 
flux against continuum magnit ude for each emissio n line. We 
have also checked the GALEX (iMartin et al.ll2005l) GR4/GR5 
database for all objects. None of the LAE classified objects 
are GALEX sources, while most of the low-z classified ob- 
jects do have counterpart GALEX detections. 

There are nine objects for which we make exceptions: four 
low EW objects we identify as LAEs and five high EW 
sources we believe are low redshift interlopers. 

1) The lowest wavelength exception is observed at 
A=3765.6A with EW obs = as index 313. If this were 
[Oil], the galaxy would be extremely nearby (45 Mpc) away 
and ha ve M r =-10.5. The photometric redshift of lllbert et all 
(120091) suggests the line to be Lya and excludes all the low-z 
options with 95% confidence. 

2) The next low EW object is in the MUNICS field as index 
51 and has nv=23.7. The detected wavelength is 4981. 6A 
withEW obs = 61±^A.The case for this object is not terribly 
strong, but the dim continuum and lack of a GALEX detection 
suggest this to be an LAE. 

3) The third low EW object is in the GOODS-N field as 
index 447 at wavelength 5017.2A with EW obs = 81±?gA. 
It was originally lis ted as an Lyman Break Galaxy (LBG) in 
ISteidel et al.l (120031) . but no redshift measurement exists in the 
literature. The counterpart has m^=24.2. 

4) The final low EW object is in the MUNICS field as in- 
dex 92 at wavelength 5683. 3 A with EW obs = 84t^A. The 
counterpart has mi/=23.3, but no GALEX detection. Again, 
this is a borderline classification. 

Next, we consider the five high EW objects reclassified as 
being at low redshifts. 

5) The first high EW low-z object is in COSMOS as in- 
dex 289 at wavelength 5235.9A with EW obs = 96±26^- It 
does not have a GALEX detection, but the counterpart has 



m/j=23.4. The photometric redshift of dllbert et al. 2009]) sug- 
gests the line to be [Oil] and excludes all other reasonable 
options with 95% confidence. 

6) This COSMOS object is index 234 with m fl =23.4, wave- 
length 5466.7A and EW obs = 149^^. As discussed in gj 
the source shows low significance emission lines such that the 
primary detection is likely [OIII]5007. Such an identification 
is possible since unlike [QU I, [OIII]5007 can have extremely 
high EWs (lHuetalJl2009h . The source also has a GALEX 
detection. 

7) This GOODS-N object is index 356 at m R =22.S, wave- 
length 5700.5A and EW nhe = 104+^ A. It has a GALEX 
detection and has a measured redshift in IBarger et alJ (120081) 
as being from [Oil] emission. 

8) This is index 439 from the GOODS-N field with 
mfl=24.2, wavelength 5762.4A andEW obs = 119tgA.The 
object is detected with GALEX. 

9) This is index 94 from the MUNICS field with nv=21.0, 
wavelength 5768.4A and EW obs = 107tfoA. The object is 
detected with GALEX. 

We next review the likely levels of contamination in the 
LAE sample from low redshift objects based on previous 
studies. The frequency of EW in bri ght, rest-frame - optica l 



lines at low redshift has been studied in Hammer et al 



iHc 
|Gi 



Hogg et al. ( 1998), Treyer et al. ( 1998), Sullivan et al 



1997), 



(2000), 



Gallego et alJ d2002l) , and lTeplitz et all d2003l) . By combining 
the observation that ^2% of [Oil] emitters have EW reS ( > 
60A dHogg etalJll998l) with th e < z < 0.4 [Oil] lumi- 
nosity function of lSullivan et alJ d2000f) and assuming no red- 
shift evolution of eith er the [Oil] or LAE EW distributions 
dGronwall et al]l2007D . we estimate our that sample may con- 
tain 1.6 high EW [Oil] interl opers. Simila r ly, if w e use the 
local luminosity function of iGallego et al.l d2002h . we pre- 
dict zero hi gh EW interloping [ Oil] emitters. As a second 
comparison iKakazu et alJ (120071) presents narrowband imag- 
ing and limited spectroscopic follow-up in their search for 
low metallicity galaxies. They find [OIII]5007 at z = 0.63 
and z = 0.83, and [Oil] at z = 1.19 and z = 1.45 at 
high enough EW values to contaminate our sample. By com- 
paring their high EW [Oil] num ber density to t he Schechter 
dSchechterj[l976l) function fits of ILy et al.l ( 120071) at z = 1.18 
and z — 1.47 without extinction corrections, we find the high 
EW [Oil] fraction should only be 3%. In contrast, the same 
analysis suggests the high EW [OIII]5007 fraction is much 
higher (33%). However, there is no evidence for such a large 
fraction of high EW [OIII]5007 over our redshifts of inter- 
est, and the VIRUS-P bandpass will always enclose [Oil] and 
[OIII]4959 when [OIII]5007 is observed. Thus, neither high 
EW [Oil] nor [OIII]5007 emitters should form important cat- 
alog contaminants. The wavelength spacing between AGN 
lines is smaller than for [Oil] and the other optical lines, so 
AGN lines should be identifiable with multiple detections. 
For example, index 4 is our only CHI] 1909 detection, iden- 
tified with a co-detection in index 5 as CIV1549. The best 
available contamination estimate is that this LAE sample con- 
tains 0-2 contaminants from mis-identified redshifts if only 
EW information is used. A complementary question is how 
our rest-frame EW affects the selectio n of high-redshift galax - 
ies. Assuming the LAE distribution in lGronwall et al.l d2007l) . 
the answer is that ~21-26% of potential detections are lost by 
the EW cut. 

We briefly state how we propogate errors to the EW esti- 
mation. In cases where the flux density measurement is very 
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noisy, the usual first-order error propagation breaks down. 
Importantly, the error on equivalent width becomes asymmet- 
ric in the case of a low S/N continuum even if the original 
errors on flux and flux density are symmetric. One simple so- 
lution is to treat the maximum liklihood distributions in flux 
and flux density as Gaussian functions, transform the flux into 
EW and flux density, and define the EW errors using the ex- 
trema of the 68% confidence interval. Similarly, in the case of 
asymmetric errors for the line fluxes, we use the same equa- 
tion evaluated with each one-sided error to arrive at final EW 
limits. When we find no upper limit, we list the upper uncer- 
tainty as lOOOA. 

5.3. Counterpart association 

The coarse spatial resolution of our VIRUS-P survey of- 
ten prevents us from associating with certainty a given emis- 
sion line to a unique broadband counterpart. However, strin- 
gent redshift probabilities can often be made by marginaliz- 
ing over all possible counterparts and their implied rest-frame 
emission-line equivalent widths. We quantify this associa- 
tion probability by using the astrometric error, discussed in 
^2.31 and the differential number counts for the i?-band im- 
ages. Since MUNICS lacks i?-band data, we use the i'-band 
there. The exact band choice for this step is not critical, so 
long as the filter samples a fairly flat spectral region for both 
low-z and high-z objects. We describe the method for i?-band 
continuum association as it applies to equivalent-width-based 
redshift discrimination. We use th e sam e formalism for AGN 
association through X-ray data in ^5.41 

The problem of assigning counterpart probabilities to detec- 
tions in mu ltiple bandpasses has been exp lored by Bayesian 
methods in I Sutherland & Saunders! ( 119921) and is com monly 
implemented in X-ray surveys (e.g. iLuo et al.l 120101) . We 
choose not to use the Bayesian technique here since it re- 
quires assuming a prior on the continuum counterparts to the 
emission-line detections. We instead make a simpler, frequen- 
tist estimate that still uses the information from multiple can- 
didates. The only assumed inputs are the astrometric error and 
the number counts of background and foreground objects. 

The probability of an emission line being associated with 
any one image-based counterpart can be constructed as the 
joint probability of all the remaining imaging detections be- 
ing unassociated and drawn from established number counts 
and the preferred counterpart having the observed offset eval- 
uated against the astrometric error budget. For simplicity, we 
treat all the individual probabilities as independent; this sim- 
plication is justified since the range of distances in our redshift 
range is much larger than cross-correlation scales between 
galaxies. We begin by identifying all the significant imag- 
ing detections within some large area of the detected emission 
line. We then define: i as the set of all imaging detections in 
the survey field, Art as the angular offset between the posi- 
tion of the emission line and the centroid of counterpart i, Si 
as the flux density of counterpart i (or X-ray flux in a defined 
bandpass), cr, as the astrometric error for the emission line 
under consideration and counterpart i, and n(S) as the differ- 
ential number count of galaxies in the observed bandpass. We 
begin by assembling the set of imaging detections with cardi- 
nality j as Cj = (j G i : Arj < 10"). The exact value of 
the angular limit is not important so long as it is several times 
the astrometric error. The chance of a superposition by one 
or more imaging detections without them being actual coun- 
terparts is then P nc j = 1 — /(0, A) where f(n, A) = ^-4 — 



is the Poisson probability distribution and A is the expecta- 
tion value for the number of galaxies brighter than Sj within 
Arj, so A = nAr 2 n(S)dS. Alternatively, the detection 

j may be the true counterpart. If we model the astrometric er- 
ror as a two-dimensional Gaussian distribution in the astrom- 
etry error and take its cumulative evaluation from infinity, the 
chance of measuring the true counterpart at Arj or further is 

-Ar 2 

P c j = cxp( 2 -i 3 ). It may also be that we have not measured 
the true imaging counterpart, either due to imaging depth or 
the emission-line detection being spurious, in which case all 
imaging detections must be explained as superpositions. 

We give in Equation[5]the full joint probabilities assembled 
from the individual probabilities just described, under the as- 
sumption that either one or none of the imaging detections is 
the true counterpart to the emission-line detection. A simi- 
lar calculation i s do ne to evaluate the significance of X-ray 
counterparts in ^5.41 

Pnc,k 

l<fc<3 

n Puck + j2 Pc - k x n Pnc ' m 

l<k<j 1< k<j \ l<m<j,mjik 

: no counterpart 



P = < 



Pc.k ^nc,m 



\ Pnc,k+ 
l<k<j 1< k<j 

: counterpart k 



Pc,k X Pnc,m 
l<m<j,m^/c 



(5) 

In the case of imaging, the astrometric error is dominated by 
the positional uncertainty of the emission-lines, but in the case 
of X-ray data the positional uncertainty of both the emission- 
line and X-ray detections are comparable and important. The 
normalization is simply chosen to make the probabilities sum 
to unity. 

In order to match i?-band objects, we performed a least 
squares m inimization fit to the R-band differential number 
counts of iFurusawa et al.l (120081) with a double power law 
function to estimate n(S) as given in Equation[6] 

5142 x (/./lO- 28 )- 1 " 6 
: f v > 7.81 x 10- 30 erg/s/cm 2 /Hz 

(6) 

10882.6 x {U/10- 28 )- 1 - 702 
: f v < 7.81 x 10- 30 erg/s/cm 2 /Hz 



n(/„)[pern°] = 



In practice, we consider a threshold distance of 1" when 
calculating the expected number counts of sources based on 
common seeing conditions to avoid the claim of total coun- 
terpart certainty at Arj =0 regardless of other counterpart 
options, although the exact threshold makes little difference. 
Finally, in defining Arj, we take the radial offset from the 
emission-line centroid to the nearest position contained in the 
imaging's Kron aperture instead of the Kron aperture center. 
This is motivated by the fact that [Oil] emission in nearby 
galaxies may be from HII regions located at large galactocen- 
tric radii. 

Object classification starts by identifying all imaging cata- 
log counterparts within 10" from the emission-line centroid. 
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The association probability for each possible counterpart is 
calculated using Equati on [5 ] The emission line is classified 
under the EW rule of ^5.21 to be at either low (most likely 
as [Oil]) or high (most likely as Lya) redshift. In 74% of 
the cases, the best counterpart probability exceeds 90%; we 
refer to these objects as the isolated sample. In another 3% 
of the cases, our analysis is most consistent with there be- 
ing no broadband counterpart; we classify these sources as 
LAEs since all the image depths imply FWj^y^ rest ^ ^0A. 
For the remaining cases, our classification is less certain due 
to there being multiple likely counterparts; nevertheless the 
most probable association is always presented. We illu strate 
ste p-by-st ep two representative classificati on cas es in i]5.5.1| 
and !5.5.2l and a case with less certainty in i j5.5.3l 

We confirm the proper classification of many low red- 
shift objects by observing multiple emission lines. In total, 
there are 118 emission-line sources with one or more asso- 
ciated emission lines in combinations of [Oil], Hi5, H7, H/3, 
[OIII]4959, and [OIII]5007. Of these, all are classified auto- 
matically by our EW cut as being at low redshift. 

5.4. AGN contamination 

We attempt to identify the Lya sources that are AGN 
through existing X-ray data. All our survey fields have ei- 
ther Chandra or XMM/Newton coverage, although to quite 
no n-uniform depths . We use the p oint-s ource catalogs 
of lElvis et all J2009I); ICappelluti et al.l d2009l) in COSMO S, 
lAlexander et al.l d2003b in GOODS-N , iPierreet alj (120041) in 
XMM-LSS, and I Watson etail J2009T) in MUNICS. T h e data 
covering MUNICS is described in ISevergnini et al.l d2005l) 
but not cataloged. The same methodology for determining 
broadband imaging counterparts is applied to the X-ray data. 
The cataloged X-ray spatial uncertainty is added in quadra- 
ture to the emission-li ne spatial uncertainty, and the fit of 
ICappelluti etaD (120071) from 2-10 keV is used for the differ- 
ential number count. Unlike with the imaging counterparts, 
the association of an X-ray source with a VIRUS-P emission 
line is nearly binary in nature: there is either a single coun- 
terpart or no counterpart. Table [5] summarizes our results by 
listing the fraction of X-ray sources in the low and high- z ob- 
jects. We find 6-8% contamination of LAEs by AGN over 
all the fields with the range depending on what fraction of 
the line detections we attribute to noise. AGN contamina- 
tion is likely a strong function of flux limit, b ut we compare 
briefly to other, deeper survey s. The sample of iGawiser et alj 
(120071) : iGronwall et ai] (120071) at z = 3.1 contains 1% AG N 
contamination, the sample of iNilsson et al.l (I2009L 120101) at 
z = 2.3 contains 6-15% AGN contamination, and the sam- 
ple of iGuaita etail ( 120101) at z = 2.1 contains 5% AGN con- 
tamination. These numbers, all utilizing X-ray detections of 
AGN, are consistent with the value we find. However, other 
work with mid-IR and far-IR AGN identification has poten- 
tially shown a much high er AGN fraction of 75% at z = 2.2 
dBongiovanni et al1l2010l) . We do not perform any mid-IR or 
far-IR AGN analysis here. There is no significant variation 
between AGN fractions of GOODS-N with the deeper X-ray 
data and the COSMOS field. The small number statistics and 
shallower X-ray data in MUNICS and XMM-LSS explain the 
lack of AGN detections in those fields. 

There are two other potential indicators of AGN activ- 
ity: broad Lya emission-line widths (which may also be 
seen in Lyman-a "blobs" where AGN activity is not evident 
dFrancis et alj|2001b Hower et al.ll2004t iMatsuda et al J 120061; 



ISmith & JarvTsll2007l: ISaito et al.ll2008l) ). and the presence of 
CIV 1549 over a fraction of the redshift range. The distri- 
butions of line widths, without any deblending of the [Oil] 
doublet, is given in Figure [19] From the distributions, it is 
clear that line width information does not aid object classifi- 
cation. We find two cases where broad line objects (FWHM 
line widths > 500 km s" 1 ) have been classifed as Lya without 
any X-ray detections, but none at > 1000 km s _1 . However, 
only one Lya and CIV 1549 source (indices 461 and 462) was 
not detected in X-ray. 

5.5. Example sources 

The rules to classify the emission-line objects have been de- 
scribed, but the display of the steps on actual VIRUS-P data 
is useful to establish confidence and the range of objects en- 
countered. We will walk through the evidence for one emis- 
sion source of each type and then give a summary display of 
representative subsamples. 

5.5.1. LAEs 

The detection image of Figure[20]shows source index 229 in 
Tables [3] and 2] as a broadened, bright emission line detected 
in four fiber positions. The high flux (41. x 10 _17 erg 
s" 1 cm" 2 ) and the lack of a spectral continuum detection are 
already sufficient to meet the classification cut as an LAE. 
However, classification from the spectrum alone is only pos- 
sible for the brightest emission-line sources in this sample. 
There are n o cou nterpart emission lines at any of the tested 
transitions ( 35. It . nor is there any associated X-ray detection. 
We next move to the deep, COSMOS R image in Figure |2"T1 
There, we find three plausible broadband counterparts with 
the brightest counterpart dominating the likelihood. There is 
no literature redshift for this object, but our Lya line identifi- 
cation leads to EWo=51^gA, so the object is classified as an 
LAE. We give compact detection images for five additional 
LAEs in Figure l22l 

5.5.2. Low-z objects 

The detection image in Figure [23] shows source index 308 
as a high S/N, only slightly and not significantly broadened, 
emission-line source detected jointly in four fiber positions. 
The line flux is 18.4l|;2 X 10" 17 erg s" 1 cm" 2 , and the lack 
of a spectral continuum is insufficient for our EW-based clas- 
sification scheme. There are no other emission lines detected 
in the object nor does the source have a X-ray counterpart. 
The COSMOS image (Figure [24]) shows one bright contin- 
uum source barely offset from the emission-line centroid. A 
second, fainter object at larger separation is also analyzed at a 
much decreased likelihood, but would carry an LAE classifi- 
cation. Based on the most probable counterpart, we find a rest 
frame EW, assuming the line to be Lya, of EWo=8+ 2 A. This 
fails the EW cut, so we classify this as a low redshift object, 
presumably an [Oil] emitter. The actual rest frame EW is then 
EWo=25lg A. We note that association with the other possible 
counterpart would lead to the opposite conclusion. However, 
the likelihood of that association is quite low, P=0.1%, so we 
confidently classify the source as an [Oil] emitter. We give 
compact detection images for five additional low redshift ob- 
jects in Figure |251 

5.5.3. Objects with uncertain redshifts 
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The detection image in Figure [26] shows source index 322 
as a high S/N, broadened line along with a continuum de- 
tection in three fiber positions. The emission line flux is 
16.7tt.3 x lCT 17 erg s" 1 cm" 2 . The COSMOS image (Fig- 
ure |27jl reveals three plausible counterparts. The most likely 
(84%) counterpart implies an easy classification as an [Oil] 
emitter with EW (, S = 37lgAHowever, there is a non-trivial 
likelihood (8%) that the counterpart not the bright galaxy, but 
instead the fainter object. In this case, the source would be 
classified as an LAE. Despite this uncertainty, we place this 
object in Tables[4]with a [Oil] classification. 

6. EMISSION LINE SOURCE CATALOG 
6.1. GOODS-N comparisons 

Most of the detections and redshift classifications in this 
catalog are new to the literature. For instance in the COS- 
MOS field, the magnitude li mit of the spect roscopic cut 
(l AB <22.5) to zCOSMOS dLillv et al.l 1200! gives little 
overlap with this sample. Fortunately, the large number of 
deep spectroscopic observations in the GOODS-N field com- 
prise a better test sample. We have m ade a detailed com - 
parison of our measurements to those of iBarger et al.l (120081) . 
which includes most previous GOODS-N me asured redshifts. 
We fu rther include one Lya match fr om lLowenthal et al] 
(fl997h and one [OIII]5007 source from lWirthet al.l (12004 . 
We note that the observations from the literature often have 
larger spectral coverage and higher resolution than our data, 
allowing alternate classification methods. 

We find 119 unique emission-line sources in GOODS-N. 
Three of these do not have measured optical broadband coun- 
terparts from the ground-based imaging, appear to be blended 
with foreground objects when exa mined with the Hubbl e 
Space Telescope (HST) images of iGiavalisco et all (120041) . 
and are without published redshifts. We classify these as 
LAEs. In addition, there are nine other LAEs where we do 
measure a robust continuum counterpart but that are with- 
out published redshifts. We give the twelve new LAEs in 
GOODS-N in Figures[28]and[29] In addition, we find 92 low- 
z objects in common to the literature and 13 high-z objects 
(12 Lya and 1 [CIV]). Finally, we find only two ojects in our 
catalog were mis-classified. Source index 371 was originally 
called an LAE, but the literature reveals it to be a CIV 1549 
emitter. Source index 400 was originally called an [Oil] emit- 
ter, but the literature reveals it to be an [OIII]5007 emitter. 
We have rectified Table 0] to reflect these two cases, and we 
then find an rms in of 0.001 and no offset compared to 

the literature, which is completely consistent with our 0.5A 
line center uncertainty. A weakness in the literature samples 
is the lack of emission-line flux calibration, so we cannot use 
the previous samples to quantitatively test this survey's com- 
pleteness. We have qualitatively confirmed the completeness 
by searching for literature objects in our spectra and finding 
many dozen at 3 < S/N < 5. 

6.2. Catalog summary 

Table [3] contains a segment of the detected emission-line 
catalog with the full version available electronically. The en- 
try is given where there is not an applicable value. Each 
emission-line is prefixed with the identifier "HPS" to stand 
for HETDEX Pilot Survey. The column descriptions are: (1) 
the catalog number, (2) the emission-line right ascension in 
hrs:min:sec (J2000), (3) the emission-line declination 
(J2000), (4) the observed emission wavelength in vacuum 



air (A) (with an estimated 0.5A uncertainty based on sim- 
ulations), (5) the intrinsic FWHM of the emission line (km 
s _1 ) after removal of a 5 A FWHM instrumental resolution 
and (with an estimated 300 km s _1 uncertainty), (6) the S/N 
of the emission-line flux detected within the aperture set of 
fibers, (7) the emission-line flux and error in 10 _ er g s - 1 
cm~ 2 as measured with the curve of growth method ( i]4.4l ), 
(8) the spatial FWHM of the emi ssio n line (") as measured 
with the curve of growth method ( £|4.41 i, and (9) any additional 
entries in the table that share a position and redshift with the 
emission line (i.e. as with detections of other emission lines 
from the same source). 

Table |4] shows a segment of the counterpart and classifica- 
tion information for each emission-line detection with the full 
version available electronically. The entry is given where 
there is not an applicable value. The column descriptions are: 
(1) the catalog number, (2) the best continuum-selected coun- 
terpart in the standard J2000 naming convention, (3) the R- 
band magnitude for this best counterpart (or the i' magnitude 
for MUNICS), (4) the probability of counterpart association 
(from Equation [5]), (5) the rest-frame EW and uncertainties 
for this counterpart and the selected transition based on the 
i?-band photometry, or the i'-band in MUNICS where no R- 
band is available (A), (6) the rest-frame EW and uncertainties 
for this counterpart and the selected transition based on an in- 
terpolation between the two nearest filters (A), (7) the transi- 
tion of the emission line based on the EW cut and/or the pres- 
ence of multiple emission lines, (8) the estimated redshift, (9) 
the probability of the emission line being Lya as calculated 
by marginalizing over all potential counterparts, and (10) the 
X-ray counterpart in the standard J2000 naming convention. 

6.3. Catalog properties 

Figure [30] compares the distribution of rest frame EW for 
LAEs and [Oil] emitters to the emission-line luminosity. His- 
tograms of the rest frame EW distributions of both low and 
high redshift sources are shown in Figure [31] A maximum 
likelihood fit was made by taking EW reS ( > 20A where the 
samples should be complete. An exponential scale length of 
128±20A fits the LAE distribution and 22± 1 .6A fits the [Oil] 
distribution. The redshift distribution of all sources is given 
in Figure [32] N o previously identifie d groups or clusters lie 
in our fields (e.g. lKoester et al.ll2007al) . In Figure[33] we give 
the color-color diagram for the sample's LAEs. We do not try 
to transform the filter systems into filter sets which are usu- 
ally applied to LBG and BX galaxy samples, but we do plot 
the loc ation of the LAE spectral temp late from lGawiser et all 
(120071) . made from iMarastor] (120051) stellar population syn- 
thesis modeling, o ver the relevant redshifts. We also show the 
locus of stars from lPicklesI (119981) . Many of the LAEs appear 
consistent with color space expectations based on continuum- 
selected samples. 

6.3.1. Spatially extended high-z sources 

Based on the detection threshold of i j4.3l we find five 
objects whose Lya emission appears significantly extended. 
Figure [34] gives the detection and broadband images for the 
objects, and Figure [35] shows the curve-of-growth analysis 
which determines their sizes. These five objects are indices 
99 and 126 in the MUNICS field and indices 162, 164, and 
261 in the COSMOS field. Index 99 is also a high EW object. 
Indices 162 and 261 are high EW if one use the i?-band con- 
tinua for the EW estimation, and they are both X-ray sources. 
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6.3.2. High EWLAEs 

LAEs with EW res t > 240A are potential sites of exotic 
energy sources or unusual metallicity since stellar popula- 
tion modeling has shown that a norma l initial mass functio n 
(IMF) cannot produce such high EWs dCharlot & Fallll 19931) . 
If we consider our whole catalog and use the EW measure- 
ments derived from interpolating with the two nearest filters, 
we find 1 1 LAEs without broadband counterparts and a fur- 
ther 21 LAEs with counterparts that have EW reS ( > 240 A at 
> la significance. However, in order to make a conserva- 
tive estimate, we instead use EW estimates based on the R- 
band photometry only and restrict the discussion to sources 
with emission-line detection S/N>6.5 to avoid false detec- 
tions. This instead leaves only 1 LAE without a counterpart 
and 2 LAEs with counterparts meeting the high EW crite- 
rion. We note that a number of the emission lines without 
broadband counterparts may have their origins obscured by 
ground-based seeing. For instance, three of the objects with 
new redshifts in GOODS-N are shown as part of Figures [28] 
and|29] For homogeneity between all fields, we only measure 
continua from the ground-based images in this work. Some 
of the entries in Table [4] as being without counterparts may 
be caused by blending and not image depth. In fact, the only 
emission-line detection with high confidence as being without 
a counterpart is index 314. 

Figure [36] shows the detection and image data for the three 
significant high EW LAEs. The top figures show the data 
for index 314, an LAE with z= 2.6312 but with no counter- 
part in the COSMOS image and EW rest > 348A (la). The 
middle figures show the data for index 126 in MUNICS. Al- 
though the counterpart is fairly bright at 77V = 24.3, the very 
bright emission line implies EW reS ( > 352A (la). We note 
that this z= 2.8276 object is also significantly extended in 
Lya with FWHM=7'.'5. Finally, the bottom figures show the 
data for index 231 in COSMOS with EW rest > 282A(la) 
and z= 2.7215. This object is marginally extended in Lya 
(FWHM=6'/3), but also compatible with a point source and 
poor seeing. We find a 3% high-EW fraction in the LAE sam- 
ple by our best estimates. However, the fraction could be as 
high as 31% by our most inclusive criteria. 

6.3.3. LAE number density expectation 

The spectral and spatial s ensit ivity limits along with the 
completeness simulation of 8 14.31 completely define the sur- 
vey's selection characteristics and are necessary inputs to the 
lumi nosity function calc ulations that will follow in future pa- 
pers dBlanc et alJ |2010|) . By considering all these effects, 
namely, the completeness di stribu tion we are able to ac hiev e 
with the detection routine ( ^4. lb and simula ted data ( j[43]l, 
and fi nally the LAE luminosity function of iGronwall et alJ 
(12007b . we predict this sample should contain 121 LAEs. 
The dominant uncertainty in this prediction is cosmic vari- 
ance, which can be approximated by linear t heory for a given 
redshi ft, volume, and number density from ISomerville et alJ 
J2001 Fi gure 3. The effective Lya survey volume in 
IGronwall etail ( 120071) is 1.1 x 10 5 hf 3 Mpc 3 which implies 
a relative cosmic variance of a v ^35% and a v ~15% for the 
volume of this survey. Within these factors, the LAE number 
statistics from this survey are low but not in serious conflict 
with earlier determinations. 

7. SUMMARY 



We present untargeted integral field spectroscopic observa- 
tions over 169 □' with the goal of characterizing emission-line 
galaxies at low (z < 0.56) and high (z ^ 2) redshifts. In this 
first of a series of papers, we describe the design, observa- 
tions, calibrations, reductions, detections, measurements, and 
classification methods for the survey. The primary classifica- 
tion method we employ uses equivalent width cut computed 
by matching the emission-line objects to continuum counter- 
parts in existing, deep images. We find that effective object 
classification can be made using EW reS ( > 20A where the 
continuum is defined using a single band of deep photometry, 
preferably in the i?-band. We find 397 unique emission-line 
galaxies: 168 over a 71.56 □' area in the COSMOS field, 118 
over a 35.52 □' area in the GOODS-N field, 79 over a 49.85 
□' area in the MUNICS field, and 32 over a 12.30 □' area in 
the XMM-LSS field. The two transitions most frequently ob- 
served are [Oil] ( 285 galaxies) and Lya ( 104 galaxies). Based 
on a non-evolving lGronwall et ail ([2007) luminosity function, 
we should have detected 121 LAEs in this survey; the differ- 
ence is within the range of cosmic variance. The field with 
the deepest X-ray data (GOODS-N) shows an AGN fraction 
in the LAE sample consistent with that of the shallower fields 
(6%). We compare our data to the extensive GOODS-N tar- 
geted spectroscopy to verify our object classification and con- 
firm 92 low-z and 13 high-z galaxies. Moreover, we derive 
new redshifts for a further 2 low-z and 12 high-z galaxies in 
the GOODS-N field. Over all fields, eleven high-z objects do 
not possess optical counterparts despite the imaging depth; 
these are either very high EW objects, contamination from 
noise, or objects whose counterparts have been blended by 
ground-based seeing. However, within the remaining LAEs 
we find a distribution of EW that can be described by an expo- 
nential scale length of 128±20A and with only three objects 
atEWo > 240Aat >1 a significance. Many of the newly dis- 
covered LAEs lie in the color ranges consistent with previous 
work. 

The main contaminant in our LAE sample is simply noise, 
which should be 10% of the LAE sample based on simula- 
tions. A totally pure subsample of 68 LAEs can be defined 
using this catalog at S/N > 6. We find five sources of Lya 
emission that have a high significance as being spatially re- 
solved, at least two of which are AGN. The pilot survey has 
validated that IFS searches for LAEs perform as expected. 
The forthcoming larger FOV of the full HETDEX survey will 
vastly improve the survey efficiency of this method. 
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APPENDIX 

ATMOSPHERIC DIFFERENTIAL REFRACTION ASTROMETRY CORRECTION 

Atmospheric differential refraction (ADR) effects over this dataset's wavelength and airmass ranges are of the order of the 
astrometric solution errors, so we have made an astrometry correction to the guider-based positions considering the emission- 
line source wavelength. There are two ADR effects on the observed fiber positions at any given wavelength: the atmosphere's 
wavelength-dependent index of refraction at a fixed airmass and the different airmasses between the science and guider FOVs. 
As we have stated in $2.11 the guider's effective wavelength is 5000A and we ignore color corrections for different guide stars. 
In order to retain the ability to stack exp osures taken at the sam e dither position, we average the positional differences over the N 
exposures and apply Equation I A 1 1 from Smart & Greenl ( 11977b as the average positional corrections for an emission-line source 
at wavelength A due to ADR where <fi is the site latitude, 5 is the declination, 6 g is the distance angle between the gui der and IFU 
cente rs, H is the hour angle at the middle o f the frame's exposure, and k is the constant of mean refraction calculated dFilippenkol 
CHI for average 2 km altitude conditions (lAllenl 19731) and related to the atmosphere's index of refraction. Common corrections 
derived this way are 0-2" with a median of 0'.'3 for our sample. 

_y> fc(A)sec 2 (5sin.ff fc(5000l) sec 2 (<5 + 9 g ) siaH 

a ~ ^ N x (tan S tan <j> + cosH) N x (tan(<5 + 9 g ) tanc/> + cosH) 

^ k(X) x (tan0 — tan (5 cos fl) fc(5000A) x (tan<^> — tan(<5 + 6 g ) cos If) 

~~ ^ N x (tan S tan 4> + cos H) N x (tan(<5 + 9 g ) tan<£ + cos if) 
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Figure 1. The layout of a VIRUS-P observation and the quality of a guider-based astrometric solution. Bottom left: The footprints of the spectroscopic science 
FOV and the northernly offset guider FOV overlayed on a Digital Sky Survey (DSS) image of the open cluster M67. This type of field is used to calibrate the 
astrometry of the guider and the fibers as discussed in £|2. 31 Top left: An expanded view of the VIRUS-P guider field with residuals from the astrometric model. 
The residuals are shown as red vectors scaled by 60 X . The rms is (X'42. Bottom right: An expanded view of the science FOV. The continuum map is generated 
from the IFS data summed over 4100A< A <5700A. Fibers that have significant flux and border other significant fibers are highlighted with green circles and 
bunched as point source detections for the astrometric fit. The residuals are shown as red vectors scaled by 60 X. The rms is $'21. The residuals in the IFU are 
less than the residuals in the guider as both fields have a similar number of degrees of freedom, but the guider has more datapoints. Top right: The expanded view 
of one fiber moving through the six dither positions. The pattern, marked with D1-D6, gives very nearly an oversampling of two. The small offsets necessary to 
complete the dither patterns are controlled by sending offsets to the guider. 




Figure 2. Left The 5cr detection limit under photometric conditions for an emission-line object perfectly centered in a fiber in three 20 minute exposures. 
Different source positions can improve or decrease this limit by ~15% which is captured in our completeness calculation. In both figures, curves are given for 
the two focal reducers, FR1 and FR2. Right The 5cr luminosity limit under photometric conditions for objects detected in the Lya line. 
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Figure 3. Left: The positional o ffset s from catalog emission-line detection positions and broadband counterpart image positions divided into two S/N bins. The 
residual correction discussed in i]2.3l has been applied. The lcr astrometric error budget for each bin is also drawn with radii 0'.'95 for 5 < S/N < 10 and C//84 
for 10 < S/N < 25. Right: A histogram of the same data shown with a Rayleigh distribution. The same dispersions are used to demonstrate the appropriate 
characterization of the astrometric error as a two-dimensional Gaussian function. 
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Figure 4. The distribution of zeropoint offsets due to non-photometric transparency as measured with the guider camera. This distribution represents the best 
60™ percentile of the observing allocation with the remaining 40% being too poor to guide or requiring dome closure. 
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Figure 5. The total system throughput of VIRUS-P, the 2.7m telescope, and the atmosphere at an airmass of one. Curves are given for the two eras of focal 
reducers, FR1 and FR2. 
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Figure 6. Sensitivity maps (la per detection element) at 4500A in 10 erg s cm . The three broken fibers in fiber bundle IFU-1 are evident. Top left: 
COSMOS, Top right: GOODS-N, Bottom left: MUNICS, Bottom right: XMM-LSS. 




Figure 7. A histogram of the ratio between the reduced data and error for all 87. 8M indepedent spectral elements in this survey. Most elements only see the 
sky background with residuals consistent to a normally distribution noise model. A small but obvious fraction of the elements also see bright, positive signal 
from continuum sources. Th e no rmalized Gaussian function shows the expected distribution in the absence of signal and systematics. The influence of the 
fiber throughput systematic ( i|3.6t likely broadens the distribution. This is most evident on the negative side which becomes nearer the normal distribution after 
background subtraction. The distribution after continuum subtraction appears much more symmetric and with a better matching width. The data wings at high 
and low ends represent the fiber positions with strong continuum. The boxcar-based continuum fitting is a rather crude tool that does not characterize all the 
continuum signal, but the emission line catalog is uncompromised by its use. 
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Figure 8. The f ractional contamination to the LAE sample by S/N cut. The detection method used is described in detail in El4.ll The LAE predictions are the 
same as in CE4l Curves are given for differently sized collections of neighboring fibers. The growth of apertures was allowed whenever the S/N was increased 
by > 0.3 with the inclusion of another fiber. At a high, constant S/N, the greatest contamination is produced by large apertures. The optimum sized aperture for 
a point source under all dither-source geometries is two at median. To optimize our selection, we make a staggered series of S/N cuts based on the number of 
fibers used, N, as S/N> 5.0 + 0.3 X (TV — 1). The horizontal and vertical lines show the evaluation points of this cut to the simulation curves. This procedure 
predicts 10%±1.6% noise contamination. 




+ + + # + 



15 

(S/N) measlJred 



Figure 9. The ratio of measured and input fluxes from point source simulation and a least-squares fit ( i]4.3t . The curve-of-growth method is used here to measure 
emission line flux. The upturn at low S/N is expected from the Eddington bias. This ratio can be applied to the LAE flux measurements of Table [3] for some 
applications. The average correction is fit as flux measured /flux true =0.98+0.74/(S/N) measured +4.27/(S/N)^ easured 
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Figure 10. The survey's catalog completeness function for multiples of the flux limit. When combined, this function, the survey's photometric wavelength-based 
flux limit in Figure [2] and th e actual mosaic pattern with photometric calibration in Figure [6] define the limits necessary for volume and luminosity function 
calculation I'Blanc et al. 2010). 
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Figure 11. The distribution of simulated source positional errors under VIRUS-P sampling patterns with a fit to the dispersion as a function of measured S/N. 
The maximum liklihood fit to the peak of a Rayleigh distribution gives a = 0'.'348 + 2'.'04/(S'/7V). 
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Figure 12. The distribution of measured source sizes in simulated data. The simulation input source sizes were drawn from the survey seeing distribution. The 
large fiber sizes set a large resolution threshold. Based on the 99.7% confidence interval, we only claim significantly resolved measurements for curve-of-growth 
FWHM sizes > 6.81". 
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Figure 13. The data, fits, and final evaluation for the curve of growth line flux measurements of a LAE (top) and an [Oil] emitter (bottom). In both, the open 
square symbols display the cumulative sampling factor on the right hand scale. The sampling factor is written in Equation|3]as N(Ar < r/jbi r 'i 8) an d i- s the 
average number of fibers overlaying the surface enclosed in radius r. The points with errors show the estimated cumulative flux on the left hand scale. The 
vertical dotted lines mark where the fit is truncated. This threshold has been selected with consideration towards being significantly larger than the widest objects 
found and small enough to limit unnecessary noise. The vertical solid line marks the radius to the fit where 90% of the flux in enclosed. The horizontal solid 
lines show the Icr confidence of the fit's normalization. The top fit returns a total flux of 24^|'q x 10~ 17 erg s _1 cm~ 2 . The bottom fit returns a total flux 

of 19±™ x 10" 17 erg s 1 cm 2 . The errors are correlated on the displayed scale, but the Monte Carlo fit varies the data from each fiber independently to 
generate proper errors. 
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Figure 14. Binned ratios of measured and input fluxes for simulated data under a range of source sizes. The curve-of-growth flux estimator is preferred as the 
least biased for extended sources compared to either the detection set of fibers or a fixed radius set of fibers as the photometry aperture. 
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Figure 15. The attempts to find matching lines to an [OIII]50()7 detection in index 323 of Table [5] The detection is formed from two fibers represented 
independently with the red and green lines. The leftmost column shows the prospective identifications of our originally detected line at 5619A. For each 
prospective identification, we attempt fits to the emission-line possibilities in the top row. In this case, matches to [Oil] and H/3 both give a clear identification. 
The [OIII]4959 is detected but overlapping with the mask around 5577A. This technique only rarely gives positive evidence for Lya classification by ruling out 
low— z emission line combinations because our wavelength bandpass is not much larger than common bright optical emission-line spacings. However, is often 
useful in classifying transitions between low redshift options as in this case. The full spectrum is shown in the large window to the left. Various continuum 
regions are evaluated by assuming the primary emission line to be Lya and [Oil] shown by the horizontal lines. The continuum fits are used to look for various 
breaks as calculated in the upper right. This galaxy is also identified by significant detections of H/3, [OIII]4959, and [Oil] in entries 325, 326, and 327 of Table 
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Figure 16. The automated search for multiple emission line in index 430. The format is the same as in Figure |15| This is one of the few cases where the 
consideration of marginally significant counterpart lines aids the classification. The primary detection is revealed to be [OIII]50()7, with marginal detections in 
[Oil] and H/3 that did not make the primary emission line catalog. 




Figure 17. The distribution of observed wave leng th and observed EW. The line marks the usual EW cut used in narrowband imaging and adopted here. 
Exceptions to the EW selection are discussed in &I5.2I Left Continuum estimated only from the i?-band photometry (or the i'-band in MUNICS). Right Continuum 
estimated from interpolation with the two nearest filters bounding each emission line's wavelength. 
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Figure 18. The distribution of emission line flux and broadband magnitude. Left Continuum estimated only from the i?-band photometry (or the i'-band in 
MUNICS). Right Continuum estimated from interpolation with the two nearest filters bounding each emission line's wavelength. 
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Figure 19. The line width distributions for all survey objects. We have subtracted an instrumental resolution of 300 km s _1 in quadrature. There is significant 
overlap between all populations making width-based classification impossible. No attempts have been made here to deconvolve the blended [Oil] doublet. The 
low-z objects are generally contained to low widths, but the LAE distribution overlaps heavily with both the AGN and low-z distributions. 
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Figure 20. The VIRUS-P emission-line detection image for index 229 which lies in the COSMOS field. This object is undoubtably a real detection with one of 
the largest S/N ratios we find for any high-z object. In this case, the aperture is grown to include four fibers. The four rows in the Figure's bottom half show 
the spectra from the four detection fibers. The right side, square cut-outs show the spectra from individual 20-minute exposures and the sky model. The three 
exposures in each fiber are then biweight combined into the two-dimensional, bottom-left spectra and the one-dimensional spectra in the upper-left line plot. The 
collapsed, one-dimensional spectra are color-coded by fiber number. The Gaussian fits to each fiber are given by dotted curves. For visual clarity, the spectra 
are resampled and stacked into the black histogram. Continuum is not detected within a 200A boxcar around the line, and the high level of flux permits the Lya 
classification from the spectrum alone in this rare case. The tabulated EW instead is based on the flux density of the imaging counterpart. The quoted central 
wavelength in this figure has not yet had the heliocentric and vacuum corrections applied as is done with the tabulated values. 




Figure 21. The ground-b ased , Vj imaging cutout for the Table[3]index 229. The four color circles represent fiber positions and are color coded in accordance 
with the spectra of Figure l20l The black circle indicates the emission line centroid. The Kron apertures from the imaging catalog are drawn as green, numbered 
ellipses. The best centered and brightest source is #10087 with an association likelihood of 87% and r + =23.4. The next two most likely counterparts are #10085 
at 6% and r + =25.1 and #10087 at 5% and r + =24.9. Either of these would also make the LAE EW cut, and it is possible that one or more LAEs at similar 
redshifts are jointly contributing to the emission-line flux, but this is unlikely. 
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Figure 22. Five example detections of LAEs with the same formats as Figures 1201 and 1211 The first two lie in the COSMOS field, the next two lie in the 
GOODS-N field with the first redshift previously measured and the second new, and the final one lies in the MUNICS field. The entries from Table |3]for these 
five are indicies 223, 160, 341, 402, and 62. The best continuum counterpart matches are to #'s 5939, 93336, 24971, 26897, and 11161. 
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Figure 23. The emission-line source detection for index 308 which lies in the COSMOS field. The format is the same as in Figure l20l The spectrum-based EW 
does not go deep enough to discriminate between the classifications. This source neither shows alternate emission lines nor has an X-ray counterpart. 




Figure 24. The ground-based, Vj imaging cutout for the Table[5]index 308. The detection spectra are given in Figure l23l The format is the same as in Figure [2l1 
The counterpart #10818 at r + =22.1 has a 99.5% likelihood of being associated. The observed EW of 30.9^tg 4 g 9 A leads to a firm low-2 classification, presumably 
for [Oil]. 
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Figure 25. Five example detections of low-z objects with the format of Figure [23] The first and fifth lie in the COSMOS field, the second and third lie in the 
GOODS-N field both with previously measured redshifts, and the fourth lies in the XMM-LSS field. The entries from Table[5]for these five are indicies 178, 351, 
406, 33, and 192. The best continuum counterpart matches are to #'s 28595, 23390, 19645, 5150, and 32863. 
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Figure 26. The emission-line sou rce detection for index 322 which lies in the COSMOS field. The spectrum-based EW suggests a low-z classification. The 
format is the same as in Figure l20l Note that the emission line is broadened compared to the instrumental resolution. 




Figure 27. The ground-based, Vj imaging cutout for the Table[5]index 322. The detection spectra are given in Figure l26l The format is the same as in Figure |2"T1 
The counterpart #66311 at r+ =2 1 . 1 has a 84% likelihood in association. The counterpart #66312 is already assigned to the VIRUS-P detection of Table [3] index 
310 at A k s = 4948. 2A. The counterpart #66310 at r+=24.9 would be a LAE based on EW and looks like a reasonable candidate system resolved from #6631 1 
in the HST image, but it only holds a 6% chance of association. This source is associated to #663 1 1 as an [Oil] emitter in the catalog. 
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Figure 28. The spectral detection and HST ACS F606W (Giavalisco et al. 2004) cutouts for six of the twelve new, high-z redshift measurements in GOODS-N. 
From top to bottom, the objects are indicies 334, 338, 360, 372, 373, and 403 in Table [4] The first and fifth objects do not have identified counterparts in the 
ground-based images due to blending, although likely counterparts are identified in the HST data. The best counterparts for the second, third, fourth, and sixth 
objects are #'s 32321, 28992, 26519, and 19646. 
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Figure 29. The spectral detection and HST ACS F606W (Giavalisco et al. 2004) cutouts for the final six of twelve new, high-z redshift measurements in 
GOODS-N. From top to bottom, the objects are indicies 415, 426, 434, 447, 467, and 474 in Table|4] The object with index 467 does not have an identified 
counterpart in the ground-based images due to blending, although a likely counterpart is identified in the HST data. The best counterparts for the remaining five, 
in order of listing, are 22030, 19592, 23908, 23670, and 28797. 
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Figure 30. The distribution of rest-frame equivalent width and line luminosity for both the LAE and low-z samples. The primary classification line based on 
EW is drawn. The jog in the EW cut line is simply due to the cut being defined as EW res t > 20A assuming the line to be Lya so the equivalent threshold in the 
[Oil] restframe is 6lA. The drawn EW cut doesn't strictly apply to the low-z objects with emission at transitions other than [Oil]. A trend between higher EW 
and line lu minosity in the LAEs is somewhat visible but noisy over this survey's dynamic range. The same trend is seen in surveys with lower flux limits and 
discussed in Cassata et al. (2010). Left Continuum estimated only from the ij-band photometry (or the i'-band in MUNICS). Right Continuum estimated from 
interpolation with the two nearest filters bounding each emission line's wavelength. 
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Figure 31. The distribution of rest frame equivalent width for values with S/N>3 for both the LAE and low-z sample by interpolating from the bounding 
broadband data. The LAE histogram has been scaled by 10 X for visual clarity. A maximum likelihood fit was made by taking EW reS ( > 20A where the samples 
should be complete. An exponential scale length of 128±20A fits the LAE distribution and 22±1.6A fits the [Oil] distribution. The exponential fits and error 
ranges are also plotted. The largest plotted bins contain all values that lie higher than the histogram range. 
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Figure 32. The distribution in redshift for all the survey objects. The upper axis labels apply to the low-z sample and the lower axis labels to the LAE sample. 
Each significantly overdense low-z bin is primarily contained in one surve y field. The overdensi ties may be early indicators of groups or clusters, but the low 
number statistics preclude firm classification. There are no clusters from the Koester et al. (2007b) catalog in any of this pilot survey's area. 



Table 1 

Summary of VIRUS-P Observations 



Date 


Number of 


Fields Median 


Range of 


Number of 




pointings 


A 


y used 


Ay used 


emission line 










detections 


2007 Oct 04-09 


3 


MUNICS S2 


0.29 


0.18-0.44 


21 




2007 Nov 05-10 


5 


MUNICS S2;COSMOS 


0.33 


0.00-1.48 


26 




2007 Dec 04-09 


6 


MUNICS S2;COSMOS 


0.24 


0.23-0.42 


36 




2008 Jan 03-10 


X 


MUNICS S2;COSMOS 


0.25 


0.00-2.84 


71 




2008 Feb 01-12 


6 


MUNICS S2;COSMOS;GOODS-N 


0.28 


0.00-1.63 


27 




2008 Apr 01-07 


3 


COSMOS ;GOODS-N 


0.30 


0.00-0.97 


30 




2008 Apr 28-May 03 


3 


GOODS-N 


0.39 


0.00-2.77 


43 




2008 Jun 03-09 


2 


GOODS-N 


0.31 


0.00-1.69 







2008 Sep 24-29 


2 


XMM-LSS 


0.36 


0.19-0.51 


28 




2008 Nov 24-30 


2 


COSMOS 


0.13 


0.07-0.46 


8 




2008 Dec 22-27 


4 


COSMOS;XMM-LSS 


0.23 


0.13-0.47 


40 




2009 Jan 21-27 


3 


COSMOS;XMM-LSS 


0.23 


0.13-0.67 


46 




2009 Feb 19-23 


4 


COSMOS ;GOODS-N 


0.19 


0.00-0.53 


27 




2009 Mar 20-25 


4 


COSMOS ;GOODS-N 


0.34 


0.00-0.92 


29 




2009 Apr 20-25 


2 


GOODS-N 


0.20 


0.07-0.48 


28 




2009 May 20-25 


1 


GOODS-N 


0.21 


0.00-1.55 


5 




2010 Feb 09-11 


1 


COSMOS 


0.30 


0.10-0.50 











Table 2 














Ancillary broadband imaging properties 










Field Central a 


Central 8 


E(B-V) Filters FWHM 


Stack 


Band k 


Depth 


VIRUS-P 


(J2000) 


(J2000) 




t 


t 




area (□') 


COSMOS 10:00:30 


+02:15:04 


0.018 u*,B/,Vj,i',i',z' 1.33 


B/r'i' 


Vj 


26.5 


71.6 


GOODS-N 12:36:51 


+62:12:51 


0.012 Uj,Bj,Vj,Rj,Ij,z' 1.26 


BjRj 


[ v. 7 


26.6 


35.5 


MUNICS-S2 03:06:41 


+00:01:15 


0.083 Bj,g',i',z' 0.99 


Bjg'i 


g' 


25.8 


49.9 


XMM-LSS 02:21:20 


-04:30:00 


0.027 u*g',r',i',z' 0.97 


g'r'i' 


g' 


25.8 


12.3 



* The worst seeing FWHM in " to which all bands are matched, 
f Filters combined to form the detection image. 
$ The band chosen for Kron aperture measurement. 

** The 5(j limit in AB magnitudes for a point source in a 1" aperture for the band with the Kron aperture measurement. 
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Figure 33. Color-color plots in the photometry bands that commonly define the Lyman Break Galaxy selection. Detections are not shown for the MUNICS 
field where we lack U -band data. The LBG selection rules are sensit ive to the exact filter and telescope choice, so we do not transform these filter data into 
systems with p ublished LBG rules. Instead, we synthesize colors of the Gawiser et al. ( 2007) LAE template as the solid, black curve for 1.3 < z < 4.5 and stars 
IPicklesll 998l). Albeit with some exceptions and frequently large color errors, the LAE sample is segregated from the low-z objects and lies where expected. Top 
XMM-LSS objects, Middle COSMOS objects, and Bottom GOODS-N objects. 
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Figure 34. The five objects significantly extended in hya. Left The spectral detections. Right The R-band images (i' for the first two MUNICS objects). First 
Index 99 in MUNICS. The best counterpart is #1 1720. Second Index 126 in MUNICS. The best counterpart is #11315. This source is also a high EW LAE. Third 
Index 162 in COSMOS. The best counterpart is #27975. This source has an X-ray detection. Fourth Index 164 in COSMOS. The best counterpart is #23399. 
Fifth Index 261 in COSMOS. The best counterpart is #68206. This source has an X-ray detection. 
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Figure 35. The curve-of-growth plots for the five significant objects extended in Lya. The format is the same as in Figure [731 and described therein. Top 
Left Index 99 in MUNICS. The best counterpart is #11720. Top Right Index 126 in MUNICS. The best counterpart is #11315. This source is also a high EW 
LAE. Middle Left Index 162 in COSMOS. The best counterpart is #27975. This source has an X-ray detection. Middle Right Index 164 in COSMOS. The best 
counterpart is #23399. Bottom Index 261 in COSMOS. The best counterpart is #68206. This source has an X-ray detection. 
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Figure 36. The three high-significance LAEs with EWq >240A. Left The spectral detection figures. Right The i?-band images. Top Index 314 in the COSMOS 
field. No continuum counterpart is found. Middle Index 126 in the MUNICS field. The best counterpart is listed as #11815. The emission line is also significantly 
spatially extended. Bottom Index 23 1 in the COSMOS field. The best counterpart is listed as #64872. 
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Table 3 

HETDEX Pilot Survey Emission Line Catalog (Abridged) 



llr o index 


a 




.5 




r W JrilVl 


S/N det 




(J2000) 


(J2000) 


(A) 


I Kill S ) 


( 1 ) 


(2) 




(3) 


(4) 


(5) 


(6) 


001 


02:21 


11.16 


-04:31:25.0 


5219.16 


229 


8.1 


002 


02:21 


12.21 


-04:32:25.3 


5448.72 


307 


5.6 


003 


02:21 


14.28 


-04:31:38.2 


4973.93 


422 


7.5 


004 


02:21 


14.86 


-04:31:56.6 


5261.37 


1285 


6.3 


005 


02:21 


15.14 


-04:31:54.0 


4270.67 


1841 


33.1 


006 


02:21 


16.26 


-04:29:32.8 


4591.58 


399 


14.8 


007 


02:21 


16.35 


-04:31:14.6 


5161.72 


293 


19.5 


008 


02:21 


17.25 


-04:27:55.7 


5820.13 


1 18 


6.7 


009 


02:21 


17.25 


-04:30:10.4 


5464.33 


78 


12.1 


010 


02:21 


17.47 


-04:27:30.6 


4808.33 


357 


15.9 


Oil 


02:21 


18.48 


-04:27:32.2 


4590.82 


441 


6.6 


012 


02:21 


19.20 


-04:30:28.9 


4398.59 


344 


9.9 


013 


02:21 


19.22 


-04:30:39.9 


5250.96 


378 


7.8 


014 


02:21 


19.47 


-04:28:21.9 


4863.62 


328 


21.8 


01 5 


02:21 


19.48 


-04-29-34 


4919 31 




21 7 


016 


02:21 


19.83 


-04:28:08.8 


4901.07 


397 


5.9 


017 


02:21 


19.85 


-04:27:43.4 


4588.13 


93 


5.9 


018 


02:21 


20.77 


-04:30:56.6 


5655.44 


972 


8.2 


019 


02:21 


20.78 


-04-30-56 4 


5452.07 


438 


26.8 


020 


02:21 


21.27 


-04:27:44.6 


5788.83 


303 


8.9 


021 


02:21 


22.13 


-04:30:35.0 


5448.33 


277 


8.6 


022 


02:21 


22.84 


-04:29:25.2 


4586.30 


603 


7.4 


023 


02:21 


22.88 


-04:29:44.3 


4540.36 


323 


9.4 


024 


02:21 


23.28 


-04:29:22.3 


4592.57 


204 


8.8 


033 


02:21:26.83 


-04:30:05.8 


5460.20 


343 


9.4 


051 


03:06:34.61 


-00:00:49.6 


4980.74 


428 


5.2 


062 


03:06:38.12 


+00:00:40.0 


3741.83 


392 


5.5 


085 


03:06:41.48 


+00:01:10.8 


3699.08 


2323 


7.5 


092 


03:06:44.44 


+00:01:46.7 


5682.19 


363 


6.0 


094 


03:06:46.23 


+00:02:18.3 


5767.11 


370 


22.7 


099 


03:06:48.36 


+00:00:17.0 


4868.35 


505 


8.3 


126 


03:06:55.26 


+00:00:33.8 


4653.11 


815 


14.2 


160 


10:00:08.61 


+02 


17:38.6 


4175.32 


663 


6.0 


162 


10:00:08.73 


+02 


15:32.0 


4167.83 


1063 


7.3 


164 


10:00:08.95 


+02 


17:23.2 


4196.31 


482 


6.6 


178 


10:00:11.39 


+02 


15:14.1 


5484.87 


283 


5.8 


192 


10:00:13.57 


+02 


13:16.6 


4893.46 


406 


9.0 


223 


10:00:18.56 


+02 


14:59.8 


4018.61 


1102 


5.7 


229 


10:00:19.39 


+02 


13:12.6 


4910.43 


625 


14.2 


231 


10:00:20.80 


+02 


19:18.8 


4524.17 


551 


7.0 


234 


10:00:21.49 


+02 


13:51.5 


5465.91 


78 


14.1 


261 


10:00:28.57 


+02 


17:48.4 


3763.70 


886 


9.8 


289 


10:00:35.24 


+02 


18:07.3 


5235.12 


428 


6.4 


308 


10:00:40.04 


+02 


12:51.7 


5622.45 


157 


7.5 


310 


10:00:40.70 


+02 


17:41.5 


4948.20 


532 


7.7 


313 


10:00:40.78 


+02 


18:23.6 


3765.58 


249 


5.9 


314 


10:00:41.09 


+02 


17:03.6 


4414.31 


352 


6.8 


322 


10:00:44.57 


+02 


18:31.6 


5057.65 


486 


7.5 


323 


10:00:45.12 


+02 


18:22.0 


5617.75 


213 


15.3 


341 


12:36:17.52 


+62 


13:10.0 


4778.04 


298 


9.3 


351 


12:36:23.05 


+62 


13:45.0 


5530.63 


352 


6.0 


356 


12:36:29.24 


+62 


11:53.4 


5698.97 


315 


9.0 


371 


12:36:35.34 


+62 


14:23.4 


4660.93 


957 


6.0 


400 


12:36:46.37 


+62 


14:08.4 


5651.77 


76 


7.2 


402 


12:36:46.86 


+62 


12:27.4 


4821.05 


630 


8.3 


406 


12:36:47.53 


+62 


15:14.3 


5715.47 


209 


7.4 


430 


12:36:52.03 


+62 


11:25.9 


5452.19 


79 


5.9 


439 


12:36:56.87 


+62 


11:52.0 


5760.95 


232 


6.5 


447 


12:36:59.37 


+62 


13:42.6 


5016.05 


318 


5.8 



Flux 
(10- 17 
(7) 



cgs) 



Spatial 
FWHM (") 

(8) 



Matching 
indices 
(9) 



42.6 1 



32.7 3 



17-4 3 _t 9 

12- 23:13 
19.91'L 

cll.2 
-12.4 
-J4? 1 165 
- 5 ' tz - 1 _14.3 
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-3.6 

49.44 4 

14.1^| 5 

38-9 3 4. 6 
21-21-L 
26-4 4 4. 9 

49-6 2 4. 8 
36.2 2 j| 8 

13- 7 5 _|.i 
13.2 3 _!.4 

45-4^.5 
83.6 4 j| 

21-84.9 
8-84.9 

1"4.3 

18.54 ! 

14- 34.5 



TC 43.6 

z3 -^-4.6 
9 15.0 

67 7 22 - 9 
D '-'-21.0 

288.343.8 

13.24.3 
134.794! 4 

33.04 3 

167-14L 
17 I 10 - 5 

/D -^-11.5 
25 4 13 - 7 

10.14.6 
23.64. 9 
39.04 5 4 
41 6 42 
30.84 6 

24- 44.0 
143.74-4 

17- 14.5 

18- 44.2 
10 7 2 9 

25-44.1 
14.44.9 
16-74 3 
36.74; 8 

12- 8j 2 .s 

IT. 713.8 
-'-'■ '-12.1 

25- 64.1 

13- 94.5 
5 9 1 - 7 

16-84 2 
lO-e^a 

6-64.9 

8-44.6 
7.2 2 -i , 



47O.8 

4 -'-0.6 



4 T.0.8 
4.44 

5.1° 



4.6 U 



-0.5 

,0.7 
-0.9 
4 8 - 2 
fi0.4 
-0.2 
47O.2 
^■'-0.2 
6 6 9 

3 6 5 

4 6°- 2 

5 9°- 7 
3y -0.7 

5 7 - 6 
3 -'-0.5 

3 6 - 7 
- , -°-0.5 

5 5°- 2 

4 q0.2 
4y -0.3 

5 3 1 - 2 
-'•-'-1.2 

5 3 11 
-'■-'-0.9 

5 6°- 4 

30 - o.6 

5 6 1 
3 - D -0.1 

5 I - 8 

31 - o.6 

3 3°- 7 
-'■-'-0.8 

5 2°- 8 
J - z -0.9 

4 6°- 7 



0.5 



4.0° 



5 5 

3U -o.6 

3 5 1 - 4 
J - ; '-1.7 



3.4 1 



ll.3 
-1.0 

3 5 - 7 

441.0 

6 2 

7 4°- 7 
'■ 4 -0.6 

7 5°- 5 
'■-'-0.3 

5 2 1 - 5 

8-34.9 

11-04.3 

4 R as 

<; 4O.6 
34 -0.6 

6 4 10 
°- 4 -0.9 

5 4O.5 
34 -0.3 

6 3 11 

D -'-0.7 

4-54.4 

8.34.6 

6 5 1 - 2 
a3 -1.3 
5 4O.8 
3>f -0.9 

4 7 
4U -0.6 

5 1 - 8 
3 - u -1.3 

3 7 1 - 1 
-'■'-0.7 

4.74 



5.1 L 



-0.8 
,0.4 
-0.3 

3-84.6 
7.1 1 - 5 



7.1 J 



-1.4 
,1.0 
-0.7 

3 - '-1.4 

2 5 1 - 
z - 3 -1.4 
-t qO.5 
- ,y -0.5 
4 1O.8 

4 il-l 
<k J -1.2 

4 2 1 - 2 

4 - Z -0.9 
4 4O.9 
4 - 4 -0.9 



19 
18 



86 
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Note. — Tablets published in its entirely in the electronic edition of the ApJS. A portion is shown here to display its form and content. 
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Table 4 

HETDEX Pilot Survey Emission Line Classifications (Abridged) 



HPS Index 


Counter- 


Counter- 


Counter- 


FWn t * 

EVV R,rest 


FW- » 

interp.rest 


Trans- 


2 est 


Lya 


X-ra 


(1) 


part 

(2) 


part niR* 
(3) 


part P 
(4) 


(A) 
(5) 


(A) 
(6) 


ition 
(7) 


(8) 


P 

(9) 


counterpart 
(10) 


001 


J022 11 12-043 126 


23.05 


0.93 


51 q 14 - 7 
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Note. — Tablcf-flis published in its entirety in the electronic edition of the ApJS. A portion is shown here to display its form and content. 
* The Johnson or SDSS R band filters used are listed in Table(2] The MUNICS fldd - al a ~3 hours, instead uses an SDSS i filter. 
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Table 5 

Emission line/X-ray counterpart statistics 



Field 


Low-z counterparts 


High-z counterparts 


Depth 

t 


COSMOS 


2/112 


4/55 


0.73 


GOODS-N 


27/94 


2/25 


0.14 


XMM-LSS 


1/24 


0/8 


~27 


MUNICS 


4/63 


0/16 


-20 



(Counterparts/tola! emission lines) 
' Assuming a point source, 10 — ^° ergs - ^ cm - ^ in 2-10 keV, if available, or 2-8 keV. The depth for 
XMM-LSS varies over the observed regions, and the sensitivity map for the X-ray coverage in MUNICS is 
not published. 



